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INTRODUCTION 


It is a mystery that APL is more than 20 years old and there are 
no APL textbooks which treat the reader as if he or she has some 
understanding of the language. The introductory APL textbooks 
available are excellent at accomplishing their objectives. However, 
they leave the novice APL programmer stranded in the real world. The 
novice APLer has the tools but not the technigues, the knowledge but 
not the experience. 


This book picks up where introductory APL textbooks leave off. Its 
goal is to build your experience quickly by exposing you to 
applications of APL. This is accomplished by presenting real world 
problems and their APL solutions. 


Most sections of the book begin with the presentation of a problem. 
You should read the problem and formulate a solution to it, given 
your knowledge of APL. Then read on. The problem is followed by a 
"good" APL solution. Compare it to yours. If different, learn from 
the differences. 


Each chapter is followed by a set of problems. The purpose of the 
problems is to confirm your understanding of the material presented 
in the chapter. You will reinforce that understanding by working on 
the problems. The solutions to the problems are in the back of the 
book. 


Some of the most valuable material in the book is presented as 
utility function solutions to problems. Therefore, you should at 
least scan the problems and solutions after reading each chapter, 
even if you feel you need no reinforcement. 


The book assumes you understand the APL primitive functions. If you 
encounter a primitive function with which you are unfamiliar, look it 
up in an introductory APL textbook. Though the book does not assume 
you are uSing any particular implementation of APL, it does make 
specific references to three versions: APL2, APL*PLUS, SHARP APL. 
APL2 is a product of IBM; APL*PLUS is the trademark of a product of 
STSC, Inc.; SHARP APL is the trademark of a product of I. P. Sharp 
Associates, Ltd. 


If you are using a different implementation of APL, the material in 
the book will still be pertinent. Only one primitive function is 


-j]- 


INTRODUCTION 


assumed which you may not have: replicate (/). If your version of 
APL supports compression but not replicate, i.e. generates a DOMAIN 
ERROR on 3/4, you will need to substitute your own replicate 
function, say REPL, whenever replicate is used. The listing of one 
such REPL function is included at the end of this Introduction. 


Most experienced APL programmers collect a set of their favorite 
utility functions. These utility functions are used to increase 
programmer productivity by solving the same problem over and over 
again. This book contains and describes more than 150 commented 
utility functions. These functions are available on a floppy disk. 
See the Postscript at the end of the book. 


Notice the workspace ID CWSID:) displayed above the header of the 
REPL function below. The WSID refers to the name of the workspace in 
which the REPL function may be found. This convention is used 
throughout the book. Every function for which a WSID is provided is 
available on floppy disk. 


In several sections of the book, you are asked to imagine extensions 
to the APL language. Each extension is then implemented via a 
utility function. These imaginary extensions to APL are intended as 
instructive and mnemonic devices to help you to quickly understand 
and remember the definition of the utility function. Please do not 
misinterpret my intent. I do not seek to have these extensions 
implemented in the current versions of APL. In some cases, the 
extensions are half-baked or are inconsistent with existing APL 
conventions. No matter. Use them as they are intended. Allow 
yourself to imagine the extension and then view the utility function 
as the implementation of that extension. 


There is much emphasis in the book on efficiency considerations. A 
chapter is devoted to the topic. In addition, relative efficiencies 
of alternative algorithms are considered throughout the book. 
Emphasis is placed upon efficiency because of its importance. At an 
introductory level of APL, you can concentrate on the conciseness of 
APL and on its elegance. But in the real world, the practical APL 
programmer must take a blue collar approach. 


Sometimes a concise and elegant algorithm requires much more 
processing time than a somewhat more complicated algorithm. The 
difference may be significant enough to make an application feasible 
or infeasible depending upon the algorithm chosen. However, 
efficiency does not need to come at the cost of clarity. If comments 
are used generously and subfunctions used judiciously, you can have 
the best of both worlds: fast, readable functions. 


I solicit comments and suggestions about the topics, presentation and 
utility functions contained herein. In fact, if you make a 
suggestion that is incorporated into the text or utility functions of 
the next edition of the book, you will receive a free copy of the 
current version of the utility functions. 
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(WSID: UTILITY] 
V R¢B REPL V;sIsN;P;T;OI0 
Aa Emulates the replicate function (R¢B/V), where B 
A may be non-Boolean. Works for scalar/vector right 
A argument only. 
A Branch if right argument not a singleton: 
»(1#Nex/oeV)eL1 
R€(+/B)ovV 
>0 
A Branch if left argument not a singleton: 
L1:2(1#x/oB)oeL2 
R¢,&(B,NJoeV 
>0 
A Origin O logic is simpler: 
L2:01T0¢0 
A Flag nonzero replication factors: 
PexB 
A Indices into V of values to replicate: 
NeoI¢P/iN 
A Indices into result of starts of runs: 
T<+\P/B 
A All-zero vector with length of result: 
R€(71TT)00 
A Insert Ist differences for subsequent +\: 
RINo0,TI¢I-Noe0,I 
A Replicate selected elements of V: 
R¢VL+\R] 
V 


Chapter 1 


LIMBERING UP 


The purpose of this chapter is to give you an opportunity to 
crack your knuckles and stretch you muscles on some APL problems. If 
you have not used APL for awhile, you will want to spend some time 
solving these problems. The effort will put your mind in the proper 
APL orientation to get the most out of the book. If the solutions 
are different from your own, spend some time studying them. Review 
any primitive functions with which you are unfamiliar. 


If you use APL daily and the problems seem simple to you, skip this 
chapter altogether. (Solutions on pages 320 to 323). 


1. What expression will change the value 645 in the vector AMOUNT to 
845? 


2. What expression will return the scalar 1 if all elements of the 
numeric vector PREMS are between 100 and 500, and will return the 
scalar O otherwise? 


3. What expression will return the number of elements in the numeric 
vector WEIGHT which are approximately equal to 24? 
"Approximately" means the numbers are rounded to the nearest 
integer before comparing to 24. 


4. What expression will return the number of elements in the matrix 
MAT? 
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5, 


10. 


11. 


Given a variable ANS which represents a numeric scalar (say 56.5), 
what expression will return the character vector 'ANSWER IS 56.5 
YEARS’ ? 


What expression will cause the character vector NAME to be 
catenated as a new row of the character matrix NAMES, assuming 
the number of elements in NAME is less than or equal to the 
number of columns in NAMES? 


What is the effect of the expression, %oe12 1? 
A. Proceed to the next statement 
B. Proceed to line 12 
C. Proceed to line 1 
D. Exit the function 


E. RANK ERROR 


Given an integer vector V of length 2xN, construct an N-element 
vector R by adding the odd elements of V (1, 3, 5, ...) to the 
even elements (2, 4, 6, ...) times 256. 


What is the result of L/t0 and why? 


What is the meaning of -\VECTOR? 


How do you resume execution of a function after an error has 
occurred and been corrected? 


Chapter 1 LIMBERING UP 
12. What happens when closing function definition mode after editing 
the header of a suspended function? 


13. What is the meaning of AA4VECTOR? 


14.a. What expression describes the shape of the result of A+.xB? 


b. If either argument is a scalar? 


15.a. What expression describes the shape of the result of Ao.>B? 
b. If either argument is a scalar? 


16. What system function can be used to determine the amount of CPU 
time consumed by an APL expression? 


17. What expressions may be used to display the character vector 
PROMPT and to allow the user to enter a response (R) on the same 
line as, and following, the display of PROMPT? 


18.a. What expression will construct a character matrix which will 
generate N blank lines when displayed? 
b. What expression will construct a character vector which will 


produce the same effect? 


19. What expression will cause all variables in the workspace to be 
erased? 
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20.a. What would you type before running the function MODEL to cause 
the computer to stop before executing each of the lines 12 and 14? 


b. The following is a partial display of the function MODEL: 


[11] T+¢A+2 
[12] Q¢INTERPOLATE T 
[13] T¢T,Q 


By executing MODEL with the stops specified above, the value of T 
after the first stop is 6 11 13 and after the second stop is 6 11 
10. What corrective action would you take? 


21. What expression will return the number of lines in the function 
CALC? 


22. Given two vectors, V1 and V2, an INDEX ERROR is signalled by the 
last of the following expressions: 


IND¢V11V2 
GOOD¢IND<oeV1 
K¢(V1x1leV1)[GOOD/IND] 


Why? 


Chapter 2 


BRANCHING AND LOOPING 


Branching in APL is a paradox. The definition of the branch 
function (>) is simple but its application is not. In this chapter 
we discuss applications of the branching function for: conditional 
branching, multi-target branching and looping. Finally, the 
efficiency considerations of looping in APL are discussed. 
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PROBLEM: Branch to the line labeled CALC if the value of the 
variable X is greater than 5. 


TOPIC: Conditional Branching 


This problem requires a conditional branch statement. If the 
condition (X>5) is true, you want the program’s flow of execution to 
proceed to the line labeled CALC. If untrue, you want to continue at 
the next statement. The conditional branch statement can be 
expressed in many ways, the following being some of the more typical: 


1. %CX>5)/CALC 4. »CALCXLX>5 7. 2%»CXS5) L/CALC 
2. 2(5>5)eCALC 5. %CALCIiUX>5 8. ®*CALC UNLESS X<5 
3. %2CX>5) TCALC 6. %®CALC IF X>5 


While all of these expressions appear to be adequate, there exist 
subtle differences between them. Algorithm 1 (/) is the most 
commonly used conditional branching algorithm. Algorithm 2 (o) is 
the fastest. Algorithm 3 (tT) is a graphic complement to algorithm 7 
(1). Algorithms 4 (xl) and 5 (ft) are more readable than algorithms 
1, 2 and 3 since the word "IF" may be read in place of the xl or [v. 
However, Xt does not work in index origin O and [ut does not allow 
branching to line 0 (i.e. exiting the function) in origin 1. 
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Algorithm 6 (IF) is the most readable algorithm but requires the 
existence of the subfunction IF (which is a problem if your function 
must be self-contained) and is slightly slower than the other 
algorithms. 


Algorithms 7 (1) and 8 (UNLESS) require the logical negation of the 
condition and so may be read as "unless". They may be used when the 
condition is expressed in such a way that the opposite condition 
requires the branch (e.g. »~(ME€MVEC)JAPPEND instead of 
*(*ME€MVEC)/APPEND ). Algorithm 8 (UNLESS) has the same slight 
disadvantages of algorithm 6 (CIF). 


Given such a variety of conditional branching algorithms, which 
should you use? I prefer to use IF and UNLESS when extreme 
efficiency and self-containment are unnecessary (most of the time). 
Otherwise, I use 09 and 1. Whichever algorithm you use, be 
consistent. APL code is easier to read when conventions are used 
consistently. 
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PROBLEM: Branch to the line labeled CALC if the value of the 
Variable X is 4, to ENTER if X is 7, to STOP if X is 9 and 
to LOOP if X is 10. 


TOPIC: Multi-target Branching 


This problem requires a multi-target branch statement. You want the 
program’s flow of execution to jump to one of four different 
locations within the program depending upon the value of the variable 
X. The branch statement can be constructed in many ways, the 
following being two of the more typical: 


1. *9CX= 4 7 9 10)/CALC,ENTER,STOP, LOOP 
2. %(CALC,ENTER,STOP,LOOP)J[4 7 9 10 UX] 


In general, the first algorithm (/) is used unless the branch 
variable (X) is an index value (1, 2, 3,...) which corresponds to the 
index of the desired label in the list of labels. Then indexing ([]) 
is used. For example, if the problem is restated such that X will 
have the value 1, 2, 3 or 4, then use the expression: 


*(CALC,ENTER,STOP, LOOP) [X] 
Notice that the first algorithm (/) actually causes a branch to one 


of five locations, not four. The fifth location is the next 
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statement and is reached when the condition is untrue for all values 
supplied (e.g. if X is 6). This bonus branch location is not 
available when using the second algorithm ([1]) since an invalid 
branch value (e.g. if X is 6) causes an INDEX ERROR. You may avoid 
the INDEX ERROR by including an additional label to which the branch 
should take place if there is no match: 


*(CALC,ENTER,STOP,LOOP,OTHER)[4 7 9 10 UX] 


The additional label not only prevents an INDEX ERROR but has a 
possible advantage over the first algorithm (/) in that you are not 
forced to drop through to the following statement when there is no 
match. 


Both algorithms cause a branch to the label corresponding to the 
first true condition. In the above example, the conditional 
expression (X= 4 7 9 10) may have at most one true condition. 
However, if the expression is rewritten (e.g. X=4 7 9 10), there may 
be more than one true condition (e.g. if X€8), in which case only the 
first true condition will be honored. 


When using multi-target branching algorithms, it is easy to overlook 
the fact that the expression 


9(L1,L2,L3,L4,L5,L6,L7)(TYPE] 
requires 8 primitive functions (9,,,,,,[£1), not 2 (%f]). The 
catenation commas are readily dismissed as mere aesthetic 
punctuation. When extreme efficiency is important, the labels should 
be catenated once, outside of any loops in which the multi-target 
branching is being employed: 


LABS€L1,L2,L3,L4,L5,L6,L7 


LOOP: 


oe td] 


>LABS( TYPE] 


>LOOP 
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PROBLEM: Construct looping logic which will allow the function 
PROCESS to be executed N times. The right argument of 
PROCESS is I, where I is the index number of the iteration 
Ci 25: Ogee eg hs 


TOPIC: Looping 


The simplest looping logic is: 


Tel 
LOOP: PROCESS I 
l¢eIt+l 

>2LOOP IF I<=N 


However, this logic breaks down when N=0. The check for completion 
is not made until after PROCESS has been executed at least once. A 
safer, but less simple, set of logic is: 


Te 

LOOP: >ENDLOOP IF I>N 
PROCESS I 

reli 

>LOOP 
ENDLOOP: 


Naturally, the conditional branch in both sets of logic above may be 
replaced by any valid form of conditional branching (discussed above). 


Notice the looping overhead which takes place within each iteration. 
In particular, the counter (I) is incremented, a comparison (>) is 
made and a conditional branch (*ENDLOOP IF ...) is performed. When 
extreme efficiency is important, some of this overhead can be removed 
from the looping logic by precalculating the branch labels: 


T¢1 

»+LAB¢ (Noe LOOP) , ENDLOOP 
LOOP: PROCESS I 

I¢I+1 

»LAB[I] 
ENDLOOP: 


Notice that this logic works correctly for the N=0 case. This 
looping logic is the most efficient possible. However, you should be 
careful when using it. The shortcoming of this approach is that you 
must have available workspace for the entire label vector. For 
example, if you plan to iterate 5000 times, you must have room for a 
5001 element integer label vector. 
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There are two other rather unconventional algorithms for looping 
which "loop" without branching back. One involves the use of execute 
Co): 


@(N2IT¢1)/LOOP¢’ PROCESS I Oo I¢I+1 9 @#CI=N)/LOOP’ 


(Note: © is an APL statement separator and is not available in all 
APL installations.) The other involves the use of a recursive 
function: 


LOOP N 
where the function LOOP is defined as: 


VY LOOP I 
[1] 2110 
[2] PROCESS 1+N-I 
[3] LOOP I-1 

V 


These two looping algorithms are confusing, inefficient and may cause 
unexpected complications (e.g. STACK FULL or WS FULL). 


Some extended APL systems which support nested arrays have a 
primitive "iterating" operator named "each" ("). The problem stated 
above can be solved via: 


PROCESS iN 


This expression is significantly simpler and more efficient than the 
sets of looping logic above. However, it has two drawbacks. The 
first is the same drawback which the precalculated label vector logic 
has, namely that you must have available workspace for the entire 
vector of counter (I) values. 


The second drawback is that this expression will not work (as is) if 
N=0. A DOMAIN ERROR or NONCE ERROR will result because the nested 
array system does not know what fill value (prototype) to associate 
with the empty result of PROCESS, should it have a result. For 
example, if the normal result of PROCESS is a character scalar for a 
numeric scalar argument, you would expect the result of PROCESS"'1O to 
be an empty character vector, not an empty numeric vector. The APL 
system has no way of knowing the nature of the result of PROCESS 
without executing it at least once. 


Viewing the "each" operator as an "iterator" rather than as a 


parallel processor can quickly lead to expressions which over-kill a 
problem. For example, consider this file-summarizing logic: 
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N€1000 

SUM¢O 

L¢1 
LOOP: SUM¢SUM++/READ I 
I¢eI+] 

>LOOP IF I=N 


This logic loops through 1000 components of an APL file, reading and 
summing the 5000-element numeric vectors found in each component. 
The equivalent nested arrays expression is: 


SUM¢+/+/° READ UN 


The expression is certainly concise. However, at one point during 
the execution of the expression, the contents of the entire 5,000,000 
element file exist in the workspace as a temporary nested variable. 
This problem can be circumvented by writing a function SUMREAD which 
returns the sum of the elements of a specified component: 


V R¢SUMREAD I 
[1] R¢+/READ I 
V 


Then, the expression can be rewritten as: 
SUM¢+/SUMREAD UN 


Now we have only the temporary 1000-element vector result of 
SUMREAD'.UN as extra baggage from the nested arrays approach. 


One of the recurring criticisms of APL is its lack of primitive 
looping constructs. Because of APL’s array-handling capabilities, 
looping is not required as often as in other programming languages. 
However, despite nested array extensions to APL, the need to loop 
still exists. 


Imagine a looping primitive (¢) which solves our PROCESS problem as 
follows: 


IT@ENDLOOP,N 
PROCESS I 
el 
ENDLOOP: 


The left argument of dyadic loop (@) or the right argument of 


monadic loop is the counter variable. The right argument of dyadic 
loop may contain from 1 to 4 elements: 
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[1] the line number (exit line) to which execution will proceed 
at the completion of the loop; 


[2] the number of iterations Cinfinite if omitted); 


[3] the value of the counter variable during the first iteration 
C(OIO if omitted); 


[4] the amount by which the counter variable is to be incremented 
or decremented after each iteration (1 if omitted). 


The monadic loop function increments the counter variable and 
branches back to the line immediately following the line containing 
the dyadic loop function if there are more iterations to perform. 
Otherwise, the counter variable is erased and the flow of execution 
proceeds to the exit line. 


APL utility functions can be written to approximate this behavior: 


>*LOOPI ENDLOOP,N 
PROCESS I 

®NEXTI 
ENDLOOP: 


The definitions of the LOOPI and NEXTI functions follow: 


CWSID: LOOP] 
V R¢€LOOPI LAB 
Initializes globals (I,loopi) for looping. 
LAB: line to branch to when loop complete, 
no. iterations, starting I, increment. 
Used in conjunction with NEXTI as: 


>*LOOPI END,100 


C7] PROCESS I 

C8] ~NEXTI 

[9] END: 

C10] 

[11] Default values of llright arg if omitted: 


rt 

OV 

Lt 
DSDDDDIDIIOIODIDIDIDD 


+infinity, O10, 1 

[13] ReLAB,LCe,LAB)JO,(L/10),0I10,1 

[14] a Exit if no iterations at all: 

[15] »C€RL1+0I01<1)0e0 

[16] TI¢eRC2+01I0] 

[17] A Top line, exit line, number of iterations, 
[18] A current I, increment, current counter: 
[19] R€eloopi¢(1+O0LC(£1+0I0]),R,1 
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C[WSID: LOOP] 

V RENEXTI;5010 
[1] a Used in conjunction with LOOPI. Returns line 
[2] aA number for next iteration of loop. Requires 
[3] aA (may erase) globals: I;loopi. 
C4] O10¢1 
[5] aA Increment I: 
C6] T¢loopil41¢loopil4j+loopil[5] 
C7] R¢loopif1] 
[8] aA Increment current counter; exit if not done: 
[9] >*Cloopil3]2loopil[6é]¢€loopil6]+1)e0 
[10] aA Else return exit line; erase I and loopi: 
[11] Reloopil2] 
[12] R¢R,OepDEX 'I loopi’ 

V 


Ps A As Na PR Po Pa NG Oe AN AG AN NG Bs A PG PNG Ba A AN Pa A Bs Re a Ns Oe 


PROBLEM: Suppose you want to compute the running balance of your 
Savings account for the last 24 months. You have made a 
Single deposit at the end of each month. DEPOSIT is a 25 
element vector of the opening balance and the 24 monthly 
deposits. RATE is a 24 element vector of the monthly 
interest rates during this time, expressed as fractions 
Ce.g. .0075 .0081 .0078 ...). You may compute the balances 
iteratively with the following formula: 


BALANCE[T+1] = DEPOSIT(CT+11] + CBALANCE[T] x CRATE(T1]+1)) 
where T goes from 2 to 25 and where BALANCE[1]=DEPOSIT[1]. 


What APL algorithm may be used to compute this stream of 
cash balances without looping? 


TOPIC: When to Loop in APL 


Because APL code is interpreted and not compiled, a looping algorithm 
is generally less efficient than a non-looping algorithm. For 
example, the expression SUM¢+/VECT will be significantly faster than 
the looping algorithm: 
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I¢SUM€O 
LOOP: >ENDLOOP IF I2eVECT 
T¢I+1 

SUM¢SUM+VECTI[T] 

>LOOP 
ENDLOOP: 


During each iteration of the loop, every symbol of code is 
reinterpreted. The addition function is actually a small portion of 
the processing being performed during the loop. 


This situation leads to two conclusions: 


1. Avoid looping in APL when possible; 


2. When looping is necessary, remove as much code as possible from 
within the loop. 


To illustrate, let us solve the above problem in a casual, looping 
fashion: 


CWSID: CASHBAL] 
V BALANCE¢RATE CASH1 DEPOSIT;I;N 
[1] aA Returns stream of cash balances for deposits 
[2] aA DEPOSIT and corresponding rates RATE. 
[3] NeoeDEPOSIT 
C4] BALANCE¢ (oDEPOSIT) 90 
C5] BALANCE[1)]¢DEPOSIT([1] 
C6] T¢e1 
C7] LOOP: 9END IF I2N 
C8] IeI+1 
[9] BALANCEL[LI1¢DEPOSIT(CIJ+BALANCE[CI-1]xRATE[CI-113+1 
[10] ~~»>LOOP 
C11] END: 
Vv 
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Now let's squeeze everything possible from the loop: 


[WSID: CASHBAL] 
V BALANCE¢RATE CASH2 DEPOSIT;B;1;LAB;N 
[1] aA Returns stream of cash balances for deposits 
[2] a DEPOSIT and corresponding rates RATE. 
C34] N¢«e DEPOSIT 
[4] BALANCE¢Noe0 
[5] BALANCEL[L1]¢B¢DEPOSIT(1) 
C6] RATECRATE+1 
C7] LAB¢ (Ne LOOP) , END 
C8] *LAB[I¢€2] 
[9] LOOP: B©EBALANCE[I]¢DEPOSIT([1I]+BxRATEL[I-1] 
C10] ~»LAB[I¢I+1] 
[11] END: 
V 


Given these modest modifications, we can expect the function CASH2 to 
take perhaps 60% to 70% as long to run as CASH1. 


Now let's look for a non-looping solution. Let’s refer to the 


elements of 1+RATE as R1, R2, R3,... Let’s refer to the elements of 
DEPOSIT as Dl, D2, D3,... Then the elements of BALANCE which we seek 
may be computed by the following expressions: 
D1 ; R1xD1 ; R1xXR2xD1 ‘ R1xR2xXR3xD1 P 
+D2 +R2xD2 +R2XR3xD2 
+D3 +R3xD3 
+D4 


Our objective is to find some APL expression which will generate this 
vector. We will accomplish this by performing a series of 
transformations to these elements until the resulting elements can be 
easily produced with an APL expression. We will then apply APL 
expressions which will reverse the transformations. 
Let's begin by defining the vector RSCAN: 
1 , RL , R1XR2 » RIXR2XR3 gh ise 
We will divide our desired result by RSCAN, giving: 
D2 D2 D3 D2 D3 D4 
Dl , Dil+t-- , Di1+--+----- D1+--+----- +--~-----=- 
R1 R1l R1xR2 R1 R1lXR2 R1xXR2XxR3 


Take the first difference (V[I+11]-V[I1) of these elements, giving: 
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Multiply the result by RSCAN, giving DEPOSIT: 
Di & ‘D2 » D3 » D4 e. ex 


Now, undo each transformation in reverse order. Undo the 
multiplication by RSCAN: 


DEPOSIT+RSCAN 


To undo the first difference, you must realize that the cumulative 
sum (+\V) is the inverse of the first difference (V-"110,V): 


+\DEPOSTT+RSCAN 
Undo the division by RSCAN: 
RSCANX+\DEPOSIT+RSCAN 
There it is. Expressed as a function: 


CWSID: CASHBAL] 

V BALANCE¢RATE CASH3 DEPOSIT; RSCAN 
[1] a Returns stream of cash balances for deposits 
C2] aA DEPOSIT and corresponding rates RATE. 
[3] aA Performs: 
[4] A BALANCE(LTJ¢DEPOSIT[LI]1+BALANCE[I-1]xXRATE[LI-1]+1 
[5] RSCAN¢ Ce DEPOSIT) 01,x\RATE+1 
C6] BALANCE¢RSCANX+\DEPOSIT+RSCAN 

V 


We can expect the function CASH3 to take perhaps 2% to 5% as long to 
run as CASH1! This significant improvement in speed does come at the 
cost of clarity. The algorithm in CASH3 screams out for comments, 
the least of which should be: 


A Performs: BALANCE[CTI]J¢DEPOSIT[C1I]+BALANCE([I-1]xRATE[I-11+1 


After seeing an elegant application of the APL scan functions to 
perform an inherently iterative function, it is easy to become 
obsessed with the pursuit of non-looping algorithms. Beware! You 
may invest a greater value of human time than is saved in machine 
time. As a further irony, you may find that your elegant and 
sophisticated non-looping algorithm is slower than a compact looping 
algorithm. 


As a guideline, do not spend your time looking for a non-looping 
algorithm unless all of the following are true: 

1. You suspect one exists; 

2. The function is used frequently and the looping algorithm is a 


"bottleneck" in the function; 


-18- 


Chapter 2 BRANCHING AND LOOPING 


3. The loop involves at least 20 iterations; 


4, Transplanting all possible logic from within the loop to 
outside the loop does not give you satisfactory performance. 


5. You do not have access to an APL "compiler". For example, STSC 
provides a product used in conjunction with its mainframe 
APL*PLUS System product which can be used to compile selected 
APL functions to improve their execution speed. The compiling 
process requires a good deal of both programmer and computer 
time but can produce dramatic efficiency improvements, 
especially on highly iterative functions. 


If you choose, or are forced, to employ a looping algorithm, you may 
Still solve the overall problem in an efficient manner by considering 
the context in which the loop is performed. To illustrate, let us 
consider the problem of computing the yield-to-maturity rates for 
1000 coupon-bearing bonds. 


Given the parameters which define the cash flows of a bond, it is 
necessary to solve for the yield by a method of successive 
approximations (looping). The APL solution to this problem is 
described in detail in the Financial Utilities chapter. For now, let 
us assume that we have an algorithm which can be used to determine 
the yield rate (to satisfactory precision) in no more than 10 
iterations. 


To compute the yield rates for the 1000 bonds, are we compelled to 
perform 10,000 iterations? 


No. We are forced to loop by successive approximation (10 
iterations) but we are not forced to loop by bond. To efficiently 
solve the problem, we may perform the 10 iterations on the parameters 
of all 1000 bonds at once. After 10 iterations, we will have the 
1000 desired yield rates. Computing yield rates by such an 
"iterative" APL approach is quite efficient. The processing speed 
will rival or surpass that of any compiled language. 
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PROBLEMS: 


1. 


2s 


oe 


a. 


BRANCHING AND LOOPING 


(Solutions on pages 324 to 326) 


What are the problems with the following conditional branch 
expression? 


»*CALCXCX>5)?1 


Assuming index origin 1, what expression will cause a branch to 


the line labeled NEGATIVE if N is negative, ZERO if N is zero or 
POSITIVE if N is positive? 


Assuming index origin 1, write the looping logic which will add 
together the 100 matrices in file components 11, 14, 17, ..., 
308. Assume the existence of the monadic function READ whose 
right argument is the number of the component to be read and 
whose explicit result is the matrix stored in that component 
(e.g. MAT¢READ 11). Use each of the following techniques: 


Normal APL looping logic Cincrement, compare, branch); 


b. Precalculated label vector logic; 


c. The hypothetical looping primitive (¢); 


da. 


e. 


The LOOPIL, NEXTI utility functions. 


The each ("') operator. 


Write non-looping APL logic which is equivalent to the 


following 
formula: 


OPRINC I] = OPRINCI-11]-CPMT-RATEXOPRIN[ I-11) 


for I from 1 to TERM, where OPRIN[OJ=LOAN. 
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5. Write a function CASH4 which uses another approach to perform the 


same task as that of functions CASH1, CASH2 and CASH3 listed in 
this chapter. Begin by defining a vector ACCUM: 


R1IxR2XR3X... » R2XR3xXR4xX... » R3xXR4XR5xX... : see ¢ at 
Perform the following transformations on the elements of BALANCE: 
A. Multiply by ACCUM 
B. Take the first difference 
C. Divide by ACCUM 


What is the result? Undo the transformations to construct the 
new algorithm and use it to write CASH4. 


The following function WRAPLP modifies its character vector right 
argument so that it will display in the width Cnumber of 
characters) specified in the left argument. The modification 
consists of inserting a newline (carriage return) character in 
place of the last blank character on each line. In that way, 
words (groups of contiguous nonblank characters) are not broken 
from one line to the next. Existing newline characters are left 
unaltered and are used to separate "sentences" within which the 
above word-wrap logic takes place. For example: 


oTEST 
70 
TEST 
THIS EXAMPLE IS NOT VERY BIG. 
THE FUNCTION WORKS ON LARGE VECTORS TOO. 
15 WRAPLP TEST 
THIS EXAMPLE 
IS NOT VERY 
BIG. 
THE FUNCTION 
WORKS ON LARGE 
VECTORS TOO. 


Rewrite the WRAPLP function to eliminate looping where possible. 
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CWSID: WP] 
V R¢WID WRAPLP CVEC;U10;BL; BREAK; 1I:L; LAST; LEN;LIM:NL;S; 

START; TCNL 
[1] aA Wraps text CVEC into lines of length WID 
[2] aA or less by inserting newline characters. 
[3] aA Origin 1: 
C4] OiLo0¢+1 
[5] aA Newline character: 
C6] TCNL¢OUTCNL A APL*PLUS 
[7] aA TCNL¢OTCL2) A APL2 
[8] A TCNL¢OAVEL157)] a SHARP APL 
[9] a Flag newline characters: 
C10] NL€CVEC=TCNL 
C11] A Index before start of each sentence: 
[12] START¢O,NL/toNL 
[13] A Lengths of sentences (between newlines): 
[14] LEN¢€7™1+C1llSTART,1+oCVEC)-START 
C15] A Flag valid break points (blank followed by nonblank): 
[16] BL€CVEC=' ! 
[17] BREAK¢BL>10BL 
[18] aA Initialize result from argument: 
C19] R¢CVEC 
[20] A Loop by sentence: 
[21] I1¢€0 
[22] LIM¢oLEN 
[23] LOOP1:>%( LIM<I¢1I+1)/0 
[24] L€¢LEN(TI] 
C25] S¢START([TI] 
[26] a Loop by line within sentence: 
C27] LOOP2:>(CL=WID)/LOOP1 
[28] A Find last break point within WID chars of line: 
C29] LAST¢+/vV\BREAK[S+®tWID] 
[30] a Advance start to new break point: 
[31] S¢S+LAST 
[32] A Insert newline: 
[33] R(SI¢TCNL 
C34] A Decrement remaining length: 
[35] L¢L-LAST 
[36] A Repeat: 
[37]  »LOOP2 

V 
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COMPUTER EFFICIENCY CONSIDERATIONS 


Time is money. The faster an APL function will run, the less it 
will cost. This is true whether you are using APL on a commercial 
remote timesharing service or on your dedicated personal computer. 

In this chapter, we discuss computer efficiency: measuring time 
consumption and understanding the factors which affect processing 
efficiency. 


BU AS BS ANG Ra A BA Pup A As Ps A NGS Pe A A Pe Be Pe Ps A A Pe Ao A A 


PROBLEM: Which expression will execute quicker on a 4000 element 
numeric vector V? 


1. R¢e+/VteV 
2. REC+/VI+eV 


TOPIC: Timing Alternative Algorithms 


The niladic APL system function OAI (Caccounting information) returns 
a numeric vector of miscellaneous usage statistics. The meanings of 
the elements of the result vary among the different implementations 
of APL. One element Cusually the second) measures the amount of 
processing time (CPU time) consumed since the current APL session 
began. It is usually expressed in milliseconds, or 60ths of a second 
or seconds. We will assume in our discussion that the index origin 
is 1 and that DAIC2] is the measure of processing time. 


Timing an algorithm is then a simple matter of checking the 
"stopwatch" before and after executing the algorithm: 


TIME1¢OAI(2] 
R¢(C+/V)+o0V 
TIME2<¢OAI(2] 
USED¢TIMEZ-TIMEL 
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These expressions should not be executed in immediate execution mode 
unless they are executed all at once on a single line: 


TIME1¢OAIC2] & R€EC+/VIFEV O TIME2OATLC2] & USED¢TIME2-TIME1 


If your implementation of APL does not have a statement separator 
(e.g. ©), the expressions should be specified as lines of a 
function. The reason for avoiding immediate execution mode is that 
CPU time is being consumed as you are typing each expression. In 
fact, on a dedicated (i.e. personal) computer, the measure of CPU 
time is equivalent to the measure of clock time. That is, the value 
for DAIC2] increases by 60 seconds every minute whether or not APL 
expressions are being executed. Therefore, the measured time will 
include typing time. 


Let us solve the problem above: 


V<400074000 

TeNAIL2] © Re+/V+pV © DAI(21-T 
675 

TeOAIL2] © Re(+/V)+eV © OAIL2]-T 
162 


From this example, we can begin to see the importance of well placed 
parentheses. In the first algorithm, the computer performs 4000 
divisions and 4000 additions. In the second, it performs 4000 
additions and 1 division. 


What happens if we time the algorithms again? 


TCOAICL2] & Re+/VieV © DAILC21-T 
692 

T¢+OATI£C2] & R¢eC+/VIFeEV O& DOAIC21-T 
LD 


We get the same approximate results but they are not exactly the 
same. Why? On a multi-user computer, the results will vary 
primarily because of the varying requirements of other users at the 
moment of execution. Even on a dedicated computer, the results may 
vary because of "house-cleaning" operations performed automatically 
and sporadically by the APL system and because of imprecise clock 
resolution. Therefore, if the accuracy of your timings is important, 
you should perform several timings and average the results. 
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PROBLEM: Write a dyadic function TIMER to time the execution of a 
specified algorithm. The algorithm is provided as an 
executable character vector right argument. TIMER runs the 
algorithm N times, where N is the left argument, and 
returns the average CPU time consumed. For example: 


25 TIMER 'R¢+/VipeV’ 
638.44 


TOPIC: A Utility Function for Timing Algorithms 


In writing the TIMER function, we will attempt to isolate the time 
consumed during the execution of the algorithm. Any time consumed 
during the overhead of the timing process itself will be deducted. 

In this way, and by averaging many samples, we can get timing results 
which are as precise as possible. 


There are two methods in APL for executing a character vector 
expression under program control. The first is to use the execute 
(@®) primitive and the second is to construct and execute a local 
function which has the expression as one of its lines. 


The first approach (¢) is simpler but not as accurate. When a 
character vector expression is executed, the expression must first be 
"parsed" so that the APL interpreter may correctly identify 
variables, APL primitive functions, character constants and numeric 
constants. It is during this parsing phase that variable names and 
function names are translated into pointers and addresses, the 
natural vocabulary of the computer. This parsing takes place when 
you enter an expression in immediate execution mode or in function 
definition mode or when you define a function under program control. 
Therefore, when you execute the expression as the line of a function, 
it has already been parsed and will execute quicker than if the 
expression is executed as the argument to the execute (¢) primitive. 


We will therefore use the second method, constructing and executing a 
local function, to time the specified algorithm. Our task is to 
define a function local to TIMER under program control which looks 
something like the following (say, to time R¢(+/V)+peV): 


V ELAPSED¢RUN1 N31 

C1] ELAPSED¢€OAI( 2] 

[2] I¢0 

C3] LOOP: >(N<I¢I+1)eEND 

[4] DOIT:R¢(+/VI+eV 

P53 >*LOOP 

[6] END: ELAPSED¢DAI[21-ELAPSED 
V 
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This function will run the specified algorithm (on line [41) N times, 
where N is the right argument (e.g. RUN1 25). It will return the 
amount of elapsed processing time consumed during its execution. 


We will define a second local function RUN2 which is identical to 
RUN1 except for line [4], which is defined to do nothing: 


[4] DOIT: 


We may then execute both RUN1 and RUN2 with the same argument, 
subtract the results (to eliminate the non-algorithm overhead) and 
divide by the number of iterations to determine the average 
processing time for a single execution of the specified algorithn. 
Therefore, TIMER will look something like: 


V REN TIMER CVEC;RUN1;RUN2 


: define RUN1 and RUN2 


C7] R¢CRUN1 N)-RUN2 N 
C8] R¢ER+N 
Vv 


How do we define RUN1 and RUN2? There are two popular methods for 
defining functions under program control. One is to build a 
character matrix which "looks" like the function, less the dels (V) 
and the bracketed line numbers. Each function line, including the 
header, occupies exactly one row of the character matrix. Each 
function line is padded with blanks to have as many characters as the 
longest function line. Such an array is called the "canonical 
representation" of the function and may be used as the definition of 
a new function in the workspace. The function is defined by a system 
function (OFX in APL2, ODEF in APL*PLUS or OFD in SHARP APL). 


The second method is to build a character vector which "looks" 
exactly like the function. The lines of the function are separated 
by the newline (i.e. carriage return) character. Such an array is 
called the "visual representation" of the function. The function is 
defined by a system function (ODEF in APL*PLUS or OFD in SHARP APL). 


There is one final comment to make before defining the TIMER 
function. Since the algorithm being timed may involve variables or 
functions having any valid names, it is possible that these 
identifiers may coincidentally be the same as the variables local to 
TIMER and RUN1. We should take some effort to name the local 
variables so that the chances of a name conflict are minimized. The 
RUN1 function we will construct will thus look like: 
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V AEACAFA ANASAIA 
C1) AEACOAI£L1+0I0) 
[2] ‘AIA€O 
[3] ALA: %CANA<ATACATA+1) 0AZA 
[4] ADA:R¢€(+/V)+eV 
[5] ALA 
[6] AZA: AEACDAICI+0I0O]-AEA 


Vv 


Let us define the TIMER function using the "visual representation" 
method. We will leave the "canonical representation" method as an 
exercise at the end of the chapter. We shall use the monadic ODEF 
system function to define the function. Its result is the character 
vector name of the function defined. The niladic system function 
OTCNL returns a character scalar newline character. 


CWSID: TIMING] 


V ARA©ANA TIMER ACA; AAA; ABA; AFA;AGA5ANLA 


ita — > Jolt te —— ch Se J 


Times the execution of the character vector 


C1] 
[2] 
[3] 
C4] 
C5) 
[6] Newline character: 

C7] ANLACUTCNL A APL*PLUS 

C8] A ANLA€UAV[156+0I0O] A SHARP APL 


ACA by running it ANA times. 


Re ep Vira Gene aly 


Returns a numeric 


scalar of the average CPU time consumed per run. 


Prepare to build local functions.. 


[9] AAAC!/VAEACAFA ANAS AIA’ ,ANLA,'CIIJAEACOAI(C1+0I10]' 


Se ae) ae ee oe a 


C10] AAACAAA , ANLA,’CL21ATACO’ ,ANLA 


ees eS se SS 8. 


[11] AAAeAAA,'[3]ALa: >(ANA<AIACATA+1) PAZA' 


C12] AAACAAA,ANLA,'LC41]ADA:' 


S44 SS Ss 


C13] ABAfCANLA,'[51>ALA’ ,ANLA 


C14] ABA€fABA, 'TE]AZA: AEACOUATC1+010) 
C15] A 

[16] A 

[17] A Define local fn AFA to run ACaA: 


[18] ARA©ODEF AAA,ACA,ABA A APL*PLUS 


-AEAV' 


[19] A ARA€3 OFD AAA, ACA,ABA A SHARP APL 


[20] 


D 


[21] A Define local fn AGA to run nothing: 


[22] AAATAAAL'F'’ ]€'G' 

[23] ARACODEF AAA,ABA A APL*PLUS 
[24] A ARA€3 OFD AAA,ABA A SHARP APL 
[25] 


me) 


[26] A Run the functions (disallow negative result): 


[27] ARACOlCAFA ANA)-~AGA ANA 
[28] A Return the average: 
[29] ARAfARA+ANA 


Bs Be PR OAs A Pe Pe Ps Os PN PR A Pe PN Pe Oe ue Oe BA BN ON Oe Bs Ps 
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PROBLEM: Define a procedure whereby the individual lines of a 
specified function may be timed so that possible 
inefficiencies can be quickly located within the function. 
The ideal end result of such a procedure will be a display 
like the following (for a 5 line function): 


TIMES TOTAL AVG MIN MAX 

LINE RUN CPU CPU CPU CPU 
1 3 450 150 122 171 

2 3 15 5 3 7 

3 153 918 6 3 8 

4 150 9,280 62 55 66 

5 3 1,065 395 240 380 

312 11,728 38 


TOPIC: Fine-tuning Production Applications for Efficiency 


After designing and implementing an application system in APL, you 
may find that it operates slower than you anticipated. In fact, the 
system may be so slow or so expensive that it is infeasible to 
operate. What can you do? 


A procedure such as the one suggested in the problem above allows you 
to examine the functions for bottlenecks. You begin with the highest 
level cover functions and work your way into suspicious 

subfunctions. Having identified the major inefficiencies, you are in 
an ideal position to correct them. A discussion of the causes and 
cures for some of the inefficiencies encountered in APL is contained 
later in this chapter. 


To aid in our discussion, suppose the function we wish to time is the 
following: 


V MODEL 
C1] SETUP 
C2] LIM¢50 © TI¢€0 
[3] LOOP:»END IF LIM<I¢I+1 
C4] PROCESS ¢ LOOP 
[5] END: CLOSE 
V 


To time the lines of this function, we need to click our "stopwatch" 
at the beginning and end of each line. This suggests the placement 
of timer functions at the start and end of each line. For example: 


C1] START © SETUP 6 STOP 
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This idea breaks down for lines which involve branches: 
{4] START © PROCESS 9° *LOOP Oo STOP 


In this example, the branch to the line labeled LOOP occurs before 
the STOP function is executed. 


Since functions placed at the end of function lines are not reliably 
executed (due to branching), we must be satisfied with placing timer 
functions only at the start of function lines. The timer function 
must then perform two tasks: to stop the stopwatch for the previous 
line (which may not be the line directly above the current line) and 
to start the stopwatch for the current line. If we call our timer 
function A, we may be able to time the MODEL function above by 
placing the timer function as follows: 


V MODEL 

© SETUP 

© LIM€50 © I€0 
OP:A © %END IF LIM<I¢I+1 
© PROCESS © LOOP 

D:A © CLOSE 


C1] 
[2] 
C3] 
[4] 
[5] 
C6] 


EF 


es 
I> 21> OF D> 


<j 


Notice that a new line must be added (line 6) to stop the stopwatch 
for the last line of the function (line 5). 


If your implementation of APL does not support statement separators 
(e.g. %), you are compelled to insert the timer function on the line 
before each function line: 


V MODEL 
C1] A 
C2] SETUP 
C3] A 
C4] LIM¢50 
C5] A 
C6] T€0 


[7] LOOP:A 
C8] SEND IF LIM<I¢1I+1 


[9] A 
C10] PROCESS 
C11] A 
[12]  »LOOP 
C13] END:A 
[14] CLOSE 
[15] A 

Vv 


-29- 


Chapter 3 COMPUTER EFFICIENCY CONSIDERATIONS 
It is the job of the timer function (A) to do the following: 


1. Record DAIL[2]. 


2. Look at a global variable (say AT) which contains the value of 
ODAICLC2] when A was last executed. Subtract this value from the 
value recorded in step 1. The result is the time consumed by 
the previously executed line. 


3. Look at a global variable (say AL) which contains the number of 
the line for which A was last executed. Using this line number 
and the consumption value computed in step 2, update the global 
variable accumulation matrix (say AM) which has one row per 
function line and 4 columns: times run, total CPU, minimum CPU, 
maximum CPU. 


4. Update AL to contain the number of the current line. 


5. Update AT to contain the value of DAI([2] at the start of the 
current line. 


To write the timer function (A), we will assume the following initial 
values for the required global variables: 


AL€O 
AM¢(N,4)90 0,(L/10),0 


No initial value is set for AT since A will not refer to it when it 
is first executed (on line 1) since AL is 0. The N used in the 
assignment of AM is the number of lines in the function being timed. 
The (L/10) is used to generate the largest possible number (the 
identity element for mimimum) for your APL system. We cannot 
initialize the minimum value to 0 since the 0O will remain as the 


minimum value. No timing result could be less. 


Let us write the timer function (assuming statement separators): 
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CWSID: TIMING] 


V A;TIME;USED;QO0I0 


C1] 
[2] 
C3] 
[4] 
C5] 
C6] 
C7] 
C8] 
C9] 
C10] 
C11] 
C12] 
C13] 
C14] 
C15] 
C16] 
C17] 
C18] 
[19] 


V 


TIME<OAI 

A Records time since last called and resets 
A stopwatch. Checks the 'stopwatch’ before 
A anything else. 

OLO¢+1 

A Branch if first time called: 

9(CxaL)JJL1 

A Compute time consumed since last called: 
USED¢TIME[21]-AT 

Aa Update accumulation matrix: 

AMCLAL; 1 2]¢AM[TAL; 1 2}3+1,USED 
AMCAL;3IJ¢AMCAL;3 ]LUSED 

AMT AL; 41]¢AMCAL;4ji°USED 

A Update line number: 

L1:AL¢OLC(2] 

A Set AL to 0 if bottom of function: 
ALCALXALS(0AM)[1] 

A Update ‘'stopwatch’ as last step: 
AT¢DATC2] 


Notice the use of OLC (line counter) to compute the number of the 
current line on which A is being called. If your implementation of 
APL does not support statement separators, and the number of lines in 
your function has been doubled because of the insertions of A, the 
computation of AL would be changed to: 


AL€(1+0LC021)+2 


Along with the A timer function, we need 3 other functions: 


TIMEADEFINE 


TIMEADISPLAY 


TIMEARESET 


'MODEL’ The TIMEADEFINE function modifies the 


function named in its character vector right 
argument by inserting the A timer function 
before each function line. It also initializes 
the global variables AL and AM. 


The niladic TIMEADISPLAY function generates and 
displays a formatted report of the contents of 
the global accumulation matrix AM. 


The niladic TIMEARESET function resets the 
global variables AL and AM to their initial 
(zeroed out) settings. Then, the function whose 
lines are being timed may be rerun and the 
results redisplayed. 
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The definition of the TIMEADEFINE function is fairly complex, 
especially if you attempt to write it without looping. Techniques 
for writing the function are discussed in a subsequent chapter 
(Boolean Techniques). The definition of TIMEADEFINE is a problem at 
the end of that chapter. 


The definition of the TIMEADISPLAY function is a straightforward 
formatting problem. 


CWSID: TIMING] 
V TIMEADISPLAY;A;B3;M;sQO10 
[1] aA Displays timing data stored in global arrays. 


[21 de’ TIMES TOTAL AVG MIN MAX! 
[3] Oe’ LINE RUN CPU CPU CPU CPU’ 
[4] De’ ---- ----- 0 nee -- 0 nen eeH-- +H He ' 


C5] OIl0¢+1 

[6] A Squeeze out rows of AM not updated: 
C7] MeAM[C CAME 311]4#0)/L1TCAM3 11 2 2 3 4) 
C8] A€¢M[ 32] 

[9] BeEME 33] 

[10] MCs31]<tpA 

C11] MCL341]¢BtA 

[12] A APL*PLUS, SHARP APL: 

C13] O¢'T4,CBI8,X1,4BCK318’' OFMT M 

C14] a APL2: 

[15] a O€C€’5550 555,559 ',320' 555,559')JSEMI 31 2],1000xXMLI;3 4 


[19] A APL*PLUS, SHARP APL: 

[20] O€¢'CBI12,X%1,2BCK31I8’' OFMT 1 3 eA,B,B+A 

C21] A APL2: 

C22] aA OC’ 555,559 ',16e’ 555,559')6A,B,BtA 
V 


The definition of the TIMEARESET function is trivial: 


C[WSID: TIMING] 
VY TIMEARESET 
Ci] AL€0 
C2] AM¢CeAaAM)e 0 0 ,CL/10),0 
V 


Some final notes on this line-timing procedure: 


1. Since TIMEADEFINE will permanently modify the function being 
timed, be sure to save a copy of the function before running 
TIMEADEFINE on it. 
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2. Functions timed by this procedure are subject to certain 
constraints: 


a. Avoid sudden exits (e.g. 20) and insure that the function 
always exits through its bottom (last line); 


b. Avoid branches to absolute or relative line numbers (e.g. 35 
or 20LC-1 or »NEXTL) if your implementation of APL does not 
support statement separators, since TIMEADEFINE will change 
the line numbers. 


The mainframe implementation of APL*PLUS provides a system function 
OMF (monitor facility) which enables you to time the individual lines 
of a function in much the same way as the utility functions above. 

If you use a mainframe APL*PLUS system, you should read the 
documentation to learn how to use OMF and its companion utility 
functions. 
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PROBLEM: What factors influence the efficiency of a user-defined APL 
function? 


TOPIC: APL Efficiency Considerations 


So far in this chapter we have discussed techniques for timing 
segments of APL code. Using these techniques, you can isolate 
inefficiencies in existing functions and you can choose between 
alternate algorithms. But how can you learn to write efficient APL 
functions in the first place? 


You can develop a feel for the efficiencies and inefficiencies of APL. 


If you were to go on a timing rampage and time every stitch of APL 
code in sight, some patterns would begin to emerge. You would begin 
to anticipate the relative speeds of algorithms without timing then. 
More important, you would find yourself formulating a mental model of 
the inner workings of the computer. Multiplication is more painful 
to the computer than addition and exponentiation more painful than 
multiplication. Out of sympathy for the machine, you will find 
yourself writing N+N instead of 2xN, and AxA instead of Ax2. 


While there is no substitute for such an encounter, below are some of 
the efficiency considerations which you may want to assimilate. 
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1. Addition (e.g. A+~2) is faster than subtraction (e.g. A-2) which 
is faster than multiplication which is faster than division which is 
faster than exponentiation which is faster than doing it by hand. 
For example, (x\No2) will typically be faster than (2*1N). 


2. Integers are easier (quicker) to manipulate than fractional 
(floating point) numbers. An array of integers will generally be 
stored internally in a more compact form (2 or 4 bytes per element) 
than an array of floating point numbers (8 bytes per element). 
Integers can be moved about (e.g. %, fT, [], /) quicker than floating 
point numbers and can be more easily operated on computationally 
(e.g. +, X, T, I). 


3. Boolean arrays (all Os and 1s) are stored as bits Cone-eighth byte 
per element) in many implementations of APL. On such 
implementations, operations involving Boolean arrays are either 
lightening fast or quicksand slow depending upon the implementation. 
If they are fast, it is because the bits are being manipulated one 
byte (8 bits) or so at a time or because the CPU is optimized for 
Boolean operations. If they are slow, it is because each bit has to 
be yanked from its byte and processed by a CPU which is better suited 
to working with 4, 8, 16, 32 or 64 bits at a time. Examples of 
functions so influenced include: A/, v\, +.A, /, #\. 


4. Elements of arrays are stored internally in raveled order. If you 
picture in your mind’s eye this internal "vector" representation of a 
matrix M, you should appreciate the reason that +/M is somewhat 
faster then +/4M. If M is a Boolean matrix (Cin which each element 
occupies one-eighth byte) the difference can be dramatic. 


5. Execute (¢@) is slower than branching because its argument must be 
parsed. For example: 


@(I>99)/'ML[15; ]<A+B’ 
is slower than: 


9(1T<99)eL1 
ML153;]¢€A+B 
Lié 


6. The workspace may be viewed as a chain of bytes. Ina clear 
workspace, all of the bytes are "clean" (unused). As you execute 
expressions (e.g. A€2 or Be1.50 or C*eA+B), the bytes become occupied 
by variables. When variables are reassigned (e.g. A€¢1+A), the old 
value of the variable is left as so much garbage. The same fate 
befalls temporary results which are the products of multiple 
expressions. For example, the expression A¢3+2xi5 produces the 
temporary results from t5 and from 2xi5. As you proceed, the 
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workspace becomes cluttered with a mixture of active variables and 
functions and garbage (unused) variables and functions. The symbol 
table is used to keep track of the locations of the active objects. 


Eventually, the computer will be asked to perform a function for 
which it cannot find sufficient clean space for the result. At that 
moment, the CPU will take a break from its APL function execution 
chores and will perform spring cleaning. All of the valid objects 
are shuffled back to the beginning of the workspace chain, the symbol 
table is updated and the remaining bytes are swept clean, ready to be 
reused. The CPU then resumes its APL function execution chores. 


It is because of these occasional workspace cleanups that you may 
notice seemingly random peaks when doing timings. Logic which 
requires a lot of temporary storage will tend to be less efficient 
than that which requires less storage. For example, index assignment 
tends to be more efficient than catenation reassignment: 


LOOP: VECLI]©<CRUNCH I 
VS. 
LOOP: VEC¢VEC,CRUNCH I 


Consider the storage requirements of the second expression when 
constructing VEC to be 1000 elements, one element at a time. For 
example, the catenate (,) function must find space for its 932 
element result while the 931 element VEC still exists. Of course, a 
moment later the 932 element vector is assigned the name VEC and the 
931 element vector is left to smolder in the ashes. On the other 
hand, the index assignment approach creates a 1000 element vector 
just once and then changes individual elements. Much less data 
shuffling is involved. 


7. Shape (e) and reshape (e) are the most primitive of primitives. 
The rank and shape of a variable are included as part of its internal 
representation. The shape function does not have to count its 
elements; it simply extracts the shape directly. Shape and reshape 
are extremely fast. For example, »BeL is faster than °B/L or BTL. 
Also, leeMAT is faster than I1ToeMAT. To construct a 100 by 100 
identity matrix (all zeros, but ones along the diagonal), the 
expression (1100)°.=1100 may seem simple enough to you but that’s 
because you do not have to perform the 10,000 mindless comparisons. 
The less intuitive expression 100 100e1,100e0 is dramatically faster 
because of its use of reshape. 


8. When performing scalar operations, time consumption can be 
measured with a ruler. Because APL is interpretive, it does not 
check for syntax errors, value errors or argument conformability 
until it executes the expression. When working with scalars, the 
time consumed making these checks tends to dwarf the time consumed 
performing the desired function. Therefore, when considering the 
efficiency of APL expressions dealing with scalars, it is more 
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pertinent to count the number of functions being executed than to 
dwell on the nature of the functions. Consider the expression: 


RCI-1LJ¢ROCI-11]+¢C1+TLI-1))+REI-11]xGlI-1])]*2 


If we count the functions being performed (counting index assignment 
as 2 functions), the result is 16. Let us rewrite the expression 
(assuming T1¢€1+T): 


weLeL 
RCJJ¢R(IIJ-TIiLIIJ+RCIIxXGlI]*2 


These expressions have the same effect as the original expression but 
involve 12 functions instead of 16. We can expect these expressions 
to run approximately 25% faster. 


Let us label the approximate time consumed when performing a function 
on scalars a "tad''. Then the original expression used 16 tads and 
the second used 12 tads. 


It is important to develop the proper perspective on tads. APL is an 
efficient language and performs tads extremely quickly. A tad is a 
miniscule unit of time. APL can perform a hundred tads ina blink of 
the eye. Tads do not become important until you write functions 
which consume thousands or tens of thousands of tads, i.e. when you 
loop. 


The lesson here is to avoid looping when APL’s array handling 
capabilities can be effectively employed. Your avoidance should not 
develop into a mania, however. Looping in APL is fast if there are 
not too many iterations (say, two dozen) or if the number of tads 
within the loop is not too large. Do what you can to keep the tads 
from getting into the tens or hundreds of thousands. 


9. Get to know the peculiarities of your APL implementation. For 
example, say you have a 100,000 element Boolean vector BV which 
contains only five 1s. Further, say you have only 1000 bytes of 
available workspace. Many implementations of APL will allow you to 
execute the expression I¢BV/leBV without producing a WS FULL error 
message. How can this be when tpBV results in an integer vector of 
100,000 elements (400,000 bytes or so)? 


In one set of implementations, the APL interpreter is clever enough 
to construe the two symbols /1 as a single function. Therefore, the 
monadic t is never executed. Instead, the /1i "function" scans its 
Boolean left argument for 1s and returns their indices. In these 
implementations, it is ironic to find a section of "optimized" code 
like the following: 


I<.teA 


IA¢A/TI 
IB¢B/T 
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The more conventional expressions will typically be much more 
efficient since they employ /l as a single function and do not 
require as much workspace: 


IACA/ieA 
IB¢B/1ioB 


In a second set of implementations, the result of monadic ul is an 
arithmetic progression vector and is stored internally as a "J 
vector". Specifically the computer stores only the vector's length, 
its starting value and its value-to-value increment. When executing 
the expression I¢BV/ieBV, the compression (/) function works with its 
"J" vector right argument without ever building the entire index 
vector. 


Expressions like 2x10+t11000 are extremely fast on APL systems which 
use J vectors. In this expression, a single addition and a single 
multiplication are performed to generate the 1000 element J vector 
result. 


If you do not know whether your APL system employs J vectors, try 
V¢lL1E9. If no WS FULL message is generated, you have J vectors. 


10. Use "compiled" functions when available. Some vendors of APL 
provide workspaces of utility functions which are written in machine 
code rather than in APL. These functions are extremely fast and 
behave like regular APL utility functions. They may be copied into 
or erased from your workspace and they can have arguments and 
results. Take some time to explore available workspaces of utility 
functions. 


In addition, much research has been conducted toward compiling APL 
code. For example, STSC provides a product used in conjunction with 
its mainframe APL*PLUS System product which can be used to compile 
selected APL functions to improve their execution speed. The 
compiling process requires a good deal of both programmer and 
computer time but can produce dramatic efficiency improvements when 
applied to bottleneck functions which consume a large portion of the 
processing time of an application systen. 


Some APL systems and some related software products allow you to run 
non-APL programs from within the APL environment. For example, if 
you have available a program written in another language (say, C or 
COBOL) which is very efficient and which performs a desired task, you 
may be able to invoke the program without ever leaving the APL 
workspace environment. 


A final caveat. This list of computer efficiency considerations can 
create a distorted perspective. Your primary goal as an APL 
programmer is not to write APL functions which run fast. Your goal 
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is to get the job done. If getting the job done means writing faster 
functions, then keep the above efficiency considerations in mind. 


In any case, you should also keep in mind these nonefficiency 
considerations: 


1. Your time is more valuable than the computer’s. If you find 
yourself laboring to find a faster algorithm, ask yourself, "Why?" 
Will the savings in computer time result in a more responsive system 
which is less frustrating to use and which saves people time? Will 
the efficiency improvements result in lower computer allocation 
charges or lower timesharing bills? If you cannot foresee material 
benefits from your efforts, you are wasting your time. 


In general, when writing a function which will perform a one-time 
task, forget efficiency. Use the code which flows most rapidly from 
your mind. Get the job done. 


2. A readable function is better than a fast one. It is a crime 
against nature to insert fast, obscure, uncommented code in a 
production application. Any algorithm which can be understood can be 
adequately commented. If you do not have the inclination to insert 
the comments, then do not use the code. By taking a moment to 
include comments with your efficient algorithm, you will write code 
which is both fast and readable. Remember, in six months the person 
who cannot understand your code may be you. 
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PROBLEMS: (Solutions on pages 327 to 329) 


1. Write a niladic function COST which will produce a display like 
the following: 


13.15 DOLLARS CONSUMED 
65.12 DOLLARS SINCE SIGNON 


The first line of the display will not appear the first time COST 
1s run and will thereafter display the dollars consumed since the 
prior execution of COST. Assume your CPU charge is 75 cents per 
unit of OAI(2]. 
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2. Write the TIMER function described in this chapter to work with 


the canonical representation method of local function definition. 


In the Sorting and Searching chapter, a function CMIOTA is 
presented for searching through the rows of one character matrix 
for the location of the rows of a second. The function is 
designed to use one of two different algorithms depending upon 
the number of rows in its arguments. Time CMIOTA (Cas defined in 
that chapter) for character matrix arguments of 10, 50, 100, 500 
and 1000 rows (12 columns) in all combinations (e.g. 50 row left 
argument and 100 row right argument). Then, change the line 


L2:>(F#1)0L4 
to 
L2:>CF#1)0L5 


and do all the timings again. The first set of timings uses a 
sorting algorithm and the second set uses a looping algorithm. 
Record these numbers. They are required by a problem in the 
Curve Fitting chapter which determines the constants to be 
plugged into CMIOTA for automatically choosing the fastest 
algorithm. 


Construct your character matrices such that the rows of your left 
argument are distinct (or nearly so) and the rows of your right 
argument are found throughout the left argument. For example: 


L¢50 12pUAVE?(050x12)0e256] (50 row left argument) 
R¢L[?100p1lpeeLI (100 row right argument) 
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POSITIONING CHARACTER DATA 


Many problems in APL involve the realignment of characters. For 
example, the title of a report may need to be centered above the body 
of the report; or a character vector entered by the user may need to 
have any extraneous blanks deleted from it. In this chapter, we 
discuss techniques for positioning the nonblank character elements of 
an array for a variety of different applications. 
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PROBLEM: Write the monadic functions DLB, DTB and DEB for deleting 
leading, trailing and extraneous blanks from a specified 
character vector. 


TOPIC: Removing Extra Blanks from Character Vectors 


The DLB, DTB and DEB functions are frequently used when accepting 
character input or when generating report output. For example, say 
you have a character matrix MONTHS of month names, left justified, an 
integer scalar MNO of the current month (1 to 12) and an integer 
scalar YR of the current year (e.g. 1987). You want to construct a 
character vector of the current month and year (e.g. 'JUNE 1987'). 
The following expression will perform the task: 


(DTB MONTHSEMNO;1]),’'’ ',¢YR 
Say you want to build a 30 column character matrix NAMES of employee 
names by prompting for one name at a time. You want each name to be 
left-justified in the matrix and to contain no extra spaces between 
the segments of the name. Use the following expression: 


NAMES¢NAMES ,C1LJ30TDEB,O 
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The following functions will perform the desired tasks. Notice that 
alternative algorithms are included in each function. The relative 
speed of each of the algorithms depends upon the implementation of 
APL you use. You may want to time them (as discussed in the Computer 
Efficiency Considerations chapter) to determine which is fastest for 
your APL environment. 


CWSID: FORMAT] 
V R¢DLB C 
[1] a Deletes leading blanks from character vector C. 
C2] ReCv\c#’ ')/C 


C3] A REC+/A\CH=' ')LC 
C4] Aa RECCCC#’ '311)-OI0OI1C 
V 
C[WSID: FORMAT] 
V R€EDTB C 


[1] a Deletes trailing blanks from character vec C. 
[2] ReCt+/v\' '#6C)eC 
C3] A R€ECOV\' '#O0C)/C 


C4) A R€EC-+/A\' '=OC)LC 
E5] aA Re COI0O-C’ '#OC)11IIC 
C6] aA R€EC1-CCH=’ ')11)1C 

V 


[CWSID: FORMAT] 
V R¢DEB C3;N 
[1] aA Deletes extraneous (leading, trailing, 
C2] aA contiguous) blanks from character vector C. 


C3] N¢C#!’ ' 

[4] R¢EC~1TNILONVIIN,O)/C 
C5] A 

Los. “A Cet "26 


C7] A Nec#’ ' 
[8] A ReE1lLCNV1ON)/C 
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PROBLEM: Write the monadic functions LJUST, CJUST and RJUST for 
left-justifying, centering and right-justifying the 
nonblank text within a specified character vector or matrix. 


TOPIC: Justifying Nonblank Segments within Character Arrays 


The LJUST, CJUST and RJUST functions are useful for constructing 
report titles and for merging character matrices. For example, to 
display ACME INC. centered within a width of 75 characters, use the 
following expression: 


CJUST 75T'ACME INC.’ 


As another illustration, say you have two 15 column character 
matrices of left-justified names, LNAMES and FNAMES. You would like 
to construct a 32 column character matrix of left-justified names in 
which the names of LNAMES precede the names of FNAMES and are 
separated by a comma and a single space (e.g. SMITH, JOHN). Use the 
following expression: 


LJUSTCRJUST LNAMES),’,’,’ ',FNAMES 
The following are the definitions of these functions: 


CWSID: FORMAT] 
V R€eLJUST C 
[1] R¢(C+/A\C=!' °')0C 
Vv 


[WSID: FORMAT] 
V R¢RIJUST C 
C1] R€EC+/V\' '#O0C)0C 


CWSID: FORMAT] 
V R¢CJUST C3B 
C1] BeCc=' ' 
[2] R¢ Cf CC+/A\B)-+/A\OB)42)0C 
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PROBLEM: 


TOPIC: 


POSITIONING CHARACTER DATA 


Given a 20 column character matrix of employee names (in 
alphabetical order), construct an 80 column matrix of the 
names such that the names run down the resulting matrix in 


4 "columns". 


The resulting matrix will have one-fourth (Cor 


so) aS many rows as the original matrix has. 


Restructuring Skinny Matrices into Fat Ones 


Let's illustrate this problem on a simple character matrix of first 


names. 


Ybeivd 


ANNE 
BILL 
CAL 
DOT 
ED 


FRED 
GAIL 
HAL 
IKE 
JOAN 


KEN 
LISA 
MIKE 
NED 
PAT 


RICK 
Vi 


In this illustration, the initial matrix has 7 columns instead of the 
specified 20 and the resulting matrix has 28 columns instead of 80. 
Still, you can see what we want to do. We will solve the problem for 
this simple 7 column matrix and then modify the solution to work for 
the specified 20 column matrix. 


The brute-force approach to this problem involves breaking the matrix 
apart into 4 pieces and then sticking them together side-by-side. 
Assume the name of the character matrix is CMAT. The number of rows 
in the desired result is computed as: 


NR¢I (1ToCMAT)+4 
NR is 5 in our illustration. 


The pieces can be extracted by using the take (T) and drop (J) 
functions: 


P1¢(NR, 7) TCMAT 
P2¢(NR,7)JTCNR,O)LCMAT 
P3¢(NR,7ITCC2XNR) ,0O) LCMAT 
P4¢(NR,7)TCC3XNR) ,O)LCMAT 
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In our illustration, P2 is: 


FRED 
GAIL 
HAL 
IKE 
JOAN 


Notice that the last expression pads P4 at the bottom with blank rows 
if there are fewer than 4xNR rows in CMAT. The last step catenates 
the 4 pieces together: 


R¢P1,P2,P3,P4 


A more elegant solution to this problem involves the use of dyadic 
transpose. We begin by padding CMAT so that its number of rows is 
divisible by 4: 


NReI (1TpCMAT)+4 
CMAT¢((4xNR),7)TCMAT 


Second, reshape the matrix into a 3 dimensional array: 
CMAT¢ (4,NR, 7) o0CMAT 

In our illustration, CMAT is now: 
ANNE 


BILL 
CAL 
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Notice that each of the planes in CMAT corresponds to one of the 
"columns" of names in the desired result. 
Third, use dyadic transpose to shuffle the planes and rows so that 
the shape changes from (4,NR,7) to (NR,4,7). Since the first 
coordinate (4) becomes the 2nd coordinate, the next (NR) becomes the 
Ist and the last (7) remains the 3rd, use 2 1 3 as the left argument 
Cor 10 2 in origin 0): 

CMAT<2 1 3&CMAT 
In our illustration, CMAT is now: 


ANNE 
FRED 


By performing this transpose, the characters of the array (if 
raveled) are in the same order as those in the desired result Cif 
raveled). 


Finally, reshape the array into the desired two-dimensional result: 
R¢€(NR,28)oCMAT 
The final solution for the 20 column problem is: 


NR¢f C1TeCMAT)+4 
R¢ECNR,80)e2 1 3804,NR,20)0(C4XNR) ,20) TCMAT 
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The dyadic transpose approach is generally more efficient than the 
brute-force approach. Its work is performed primarily by the 
relatively efficient reshape and transpose functions. The 
brute-force approach makes heavy use of the less efficient take, drop 
and catenate functions. 
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PROBLEM: Write a function TITLES which will return a character 
matrix of titles which will be displayed at the top of a 
report. The left argument is an integer scalar of the 
width of the resulting character matrix (i.e. the width of 
the report). The right argument is a delimited character 
vector (e.g. 'NOPERATING STATEMENTNDEC. 31, 1987N(C$000''S)') 
whose "partitions" each begin with one of the delimiters 
¢ (left-justify)., n (center) or > (right-justify). The 
result has one row per partition. Each partition is 
justified within the row according to the delimiter. For 
example: 


HDG¢50 TITLES’>PAGE 1NOPERATING STATEMENTN1987c($000s) ’ 


HDG 


PAGE 1 
OPERATING STATEMENT 
1987 
($000s) 


TOPIC: Delimited Character Vector to Justified Matrix 


Let us define the header of the TITLES function: 
VY R¢EWID TITLES CS 
We will use origin 0 throughout: 
OIO0¢O 
Determine which elements of CS are justification symbols: 


JUST¢€’cn>'iLCS 
BJIUST¢JUST<3 


BJUST is a Boolean vector with 1s corresponding to justification 
symbols. 
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JUST¢BIUST/JUST 
NROWS¢pJUST 


JUST is an integer vector with one element per justification symbol 
and whose values indicate which symbol (0: left; 1: center; 2: 
right). NROWS is the number of titles (delimiters). Determine the 
length (LEN) of each delimited partition (i.e. each title), excluding 
the justification symbol: 


T¢BUUST/tLeBIUST 
LEN¢C11T,oBJUST)-T+l1 


Reduce (truncate) those lengths which exceed the specified width WID: 
LEN¢WIDL LEN 


Determine the indices into CS of the characters which start each 
title (i.e. the character after the justification symbol): 


ARGSTART€1+T 


Determine the number of leading blanks required per title to justify 
it (left, center or right) within the matrix result: 


LEAD¢ (JUST#0)xL CWID-LEN)+1+JUST=1 


Determine the indices into the raveled matrix result of the 
characters which start each title: 


RESSTART¢ LEAD+WIDxXtLNROWS 


Initialize the result to have the correct number of characters but to 
be all-blank and raveled: 


R¢(NROWSXWID)J0’ ' 


All that remains is to extract the titles from CD (we know the 
starting positions, ARGSTART, and the lengths, LEN, of each title), 
insert them into R (we know the starting positions, RESSTART, and the 
lengths, LEN) and then reshape R to the proper shape. If there was 
but one title, we could do the following: 


RLRESSTART+1LLEN ]¢CSCLARGSTART+1ULEN ] 
Unfortunately, monadic t will only work with a one element argument. 
Imagine an enhanced monadic ut function which exhibits the following 


vector behavior: 


15 2 4 
01234031012 3 (remember: OIO0O=0) 
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Let us assume a function MONIOTA which will work with vectors as 
above. Then we can finish the function: 


RL (LEN/RESSTART)+MONIOTA LEN]<CS[ (LEN/ARGSTART)+MONIOTA LEN] 
R¢(NROWS ,WID) oR 


The definition of the MONIOTA function we need follows: 


C[WSID: UTILITY] 
VY R¢€MONIOTA LEN 
C1] aA Performs: (CLLENC1]),(0LLEN(23]),CtLEN(3]),... 
C2] a In APL2: Reéu LEN 
[3] R¢LEN/-~110,+\LEN 
C4] R¢R+ieR 
Vv 


As an exercise, you should reread the TITLES logic above to see what 
happens when some of the partitions are empty (e.g. 80 TITLES 
'NBALANCE SHEETNNNDEC. 31n’). 


Finally, let us redefine the TITLES function slightly to allow it to 
function as a typical character "vector to matrix" converter. Such a 
function takes a delimited character vector argument and converts it 
to a character matrix with one row per partition and with as few 
columns as possible (equal to the length of the longest partition). 
The rows of such a matrix result are usually left-justified (padded 
to the right). We will redefine the left argument of TITLES to be 
either the width of the resulting matrix or an empty vector if the 
width is to be automatically determined as the length of the longest 
partition. 


To implement this enhancement, we need only precede the line, 
LEN¢WIDLLEN, by the following: 


WID¢1TWID,f /LEN 
Now the TITLES function may be used in the following way: 


’' TITLES 'CREDCORANGECYELLOWCGREENCBLUE’ 
RED 
ORANGE 
YELLOW 
GREEN 
BLUE 


(The APL purist may prefer to express the empty vector left argument 
to TITLES as an empty numeric vector such as 00 rather then the 
empty character vector '’. In that way, the WID,I!/LEN operation does 
not engender a conceptual domain error from the catenation of 
character and numeric data. However, since most implementations of 
APL "forgive" the catenation of character and numeric datatypes when 
one of the arguments is empty, this preference is academic. ) 
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The completed TITLES function is listed below. 


C(WSID: FORMAT] 
V REWID TITLES CS;0IO;ARGSTART; BJUST;JUST; LEAD; LEN ;NROWS 
;RESSTART 5 T 
[1] aA Creates report titles from text CS within page 
[2] a width WID. CS is delimited by 'cn>’ indicating 
[3] a left, center, right justification respectively. 
C4] O10¢0 
[5] a O:left; l:center; 2:right; 3:not a delimiter: 
C6] BJUST¢3>JUST€! cn>' LCS€,CS 
(71 aA Select just delimiters; determine no. titles: 
C8] NROWS¢eJUST¢BIUST/JUST 
[9] aA Title lengths: 
[10] T¢BJUST/LPBIUST 
[11] LEN€(11T, pBJUST)-T+1 
[12] A Set WID as largest title length if empty WID 
[13] A provided; truncate titles to specified width: 
[14] LEN¢LENLWID¢1TWID,//LEN 
[15] A Index of char following each delimiter: 
[16] ARGSTART€1+T 
[171 A Leading blanks per title, to justify: 
[18] LEAD¢(JUST#0)xL CWID-LEN)+1+JUST=1 
[19] A Ind in raveled result where each segm. starts: 
[20] RESSTART¢<LEAD+WIDXUNROWS 
[21] A Blank, raveled result: 
[22] R¢(NROWSXWID)o’ ' 
C23] A T¢MONIOTA LEN: 
[24] T¢T+ Le T¢LEN/-~110,+\LEN 
[25] RIT+LEN/RESSTARTI]¢CS[T+LEN/ARGSTART] 
[26] R¢CNROWS ,WID)JoeR 
V 
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PROBLEMS: (Solutions on pages 330 to 333) 


1. Given a character vector TEXT which contains embedded newline 
characters (carriage returns) and given the scalar NL which is 
the newline character, what expression will return the first line 
of text Cup to, but not including, the first newline)? 
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2. Given a character vector CODE, find all occurrences of the string 
‘'/_.'. Return a bit vector which has the same length as CODE, 
with a 1 in each element which corresponds to the '/’ ina '/v' 
pair. All other elements are zero. 


3. Write a dyadic function CENTER which returns a character vector 
whose length is specified by the left argument and in which the 
character vector right argument is centered. For example: 


ef¢50 CENTER 'ACME’ 
ACME 
50 


Test your function on each of the following: 


50 CENTER 'ACME’ 
49 CENTER 'ACME’ 
3 CENTER ’ACME’ 
11 CENTER 'A' 


4. Suppose you have a dyadic function COLFMT which formats a numeric 
matrix into a character matrix. Its right argument NMAT is the 
numeric matrix to be formatted and its left argument CTL is an 
integer vector with one element per column of NMAT. The integers 
indicate the number of decimal places, for each numeric column, 
to be displayed in the character matrix result CMAT. Each number 
is formatted in a width of <width> characters (e.g. 10), where 
<width> is an integer scalar global variable. For example: 


ef¢3 0 1 COLFMT 4 39112 


1.000 2 3.0 
4.000 5 6.0 
7.000 8 9.0 
10.000 a Eel 12.0 


4 30 


Write a function ROWFMT which has the same syntax as COLFMT 
except the elements of its left argument correspond to the rows 
of the numeric matrix argument rather than to the columns. For 
example: 
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eH¢+3 0 1 2 ROWFMT 4 3e112 


1.000 2.000 3.000 
4 5 6 

7.0 8.0 9.0 
10.00 11.00 12.00 


4 30 


ROWFMT should use COLFMT. 


5. Write a dyadic function COLUMNIZE which will restructure a skinny 
matrix into a fat one as described in this chapter. The right 
argument of COLUMNIZE, CMAT, is the original skinny character 
matrix and the left argument is the number of "columns" of CMAT 
across the width of the fat character matrix result. For 
example, to solve the problem presented in that section, you 
would use: 


R¢4 COLUMNIZE CMAT 


Allow a 1 or 2 element left argument. If 2 elements, the first 
1s the number of rows per "page" and the second is the number of 
"columns" as discussed above. The result is a 3 dimensional 
character array with one plane per page. For example, using the 
7 column character matrix illustrated in this chapter: 


2 3 COLUMNIZE CMAT 


ANNE CAL ED 
BILL DOT FRED 
GAIL IKE KEN 
HAL JOAN LISA 
MIKE PAT VI 
NED RICK 


6. Write a function HEADINGS which will behave as described below: 


SYNTAX: CMAT¢WIDS HEADINGS CVEC 


DESCRIPTION: 


HEADINGS is used to convert a delimited character vector into a 

character matrix of column headings whose respective widths are 

given by the vector WIDS. Each substring of CVEC is preceded by 
a Gelimiter (n) and may contain any number of newline delimiters 
(¢). The newline delimiters therefore separate sub-substrings. 

Typically, one width (element of WIDS) is provided for each 
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heading (substring). However, if fewer widths are provided, they 
are repeated to match the number of headings in CVEC. The 
headings are formatted into a character matrix according to the 
following procedure: the sub-substrings of each heading are 
truncated if necessary to the corresponding width for that 
heading; the sub-substrings are padded to the left and right with 
spaces to bring each sub-substring up to the width for that 
heading; the sub-substrings are catenated together as rows 
(centered with respect to one another); a row of underlines 
(hyphens) is catenated to the bottom of each heading; the 
headings are padded on the top so that each heading has the same 
number of rows; the headings are catenated together separating 
them by 2 columns of blanks (if there are more elements in WID 
than there are headings defined by the right argument, the 
remaining elements are used as the numbers of columns of blanks 
to be inserted between each of the pairs of headings): 


10 13 8 HEADINGS 'nNNAMESNHIRE©DATENAGE¢ATCHIRE’ 


AGE 
HIRE AT 
NAMES DATE HIRE 


—e eee ee ee ee ee ee ee ee i 


10 13 8 4 1 HEADINGS 'nNAMESNHIRE*©DATENAGECAT¢HIRE ' 


AGE 
HIRE AT 
NAMES DATE HIRE 


—_— oe oo oe ee ee ee eee ee — a es ee eee eee —— oe eee eee 


Empty substrings in CVEC are displayed without underlines. To 
include an all-blank heading which is underlined, insert at least 
one blank character in the corresponding substring. 


~52- 


Chapter 5 


SORTING AND SEARCHING 


Many applications in the real world deal with lists of things. 
In APL those things are typically represented as numbers and the 
lists as vectors; or the things are represented as character vectors 
Crows) and the lists as matrices. That is, real world lists are 
usually represented in APL as numeric vectors or character matrices. 


Since the most common operations performed on lists include sorting, 
searching and selecting, these too are among the most important APL 
operations on vectors and matrices. In this chapter, we discuss 
primitive and utility APL functions for performing sorting and 
searching. In the next chapter, we discuss selecting. 
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PROBLEM: Suppose you have three 1000 element numeric vectors, ENUM 
Cemployee identification number), AGE Cemployee age) and 
OFFICE Coffice identification number), that these vectors 
are in one-to-one correspondence and that each element 
corresponds to a single employee. How can you reorder 
these three vectors such that they remain in one-to-one 
correspondence (i.e. the same index in each vector still 
corresponds to a single employee) but are sorted by office, 
and within office by age, and within age by employee number? 


TOPIC: Major-to-minor Sorting 


Sorting in APL is a two-step process: determine the "grade vector" 
of the vector to be sorted; and reorder the original vector by 
indexing the original vector with the grade vector. Therefore, to 
sort a vector SALARY in ascending order, you would employ the 
following expression: 


SORTEDSAL¢SALARY[ 4SALARY ] 
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The grade-up function (4) returns the grade vector and the indexing 
function ([1]) reorders the elements. Note that while we have "sorted 
SALARY", the variable SALARY remains unsorted (unless we reassign 

it: SALARY¢SALARY[A4SALARY]). 


Why does sorting require two steps in APL? Because the grade vector 
is required if we are dealing with several corresponding vectors 
whose elements must remain in one-to-one correspondence. For 
example, if we want to reorder ENUM, AGE and OFFICE such that the 
values of ENUM are in ascending order but still have corresponding 
elements in AGE and OFFICE, we must do the following: 


GRADE¢4ENUM 
ENUM¢CENUMLE GRADE J 
AGE¢AGE[ GRADE ] 
OFFICECOFFICEL GRADE] 


Note that the values of AGE and OFFICE are now not in ascending 
order. They have simply been reordered to continue to correspond to 
ENUM which is in ascending order. 


The sort required in the problem stated above is called a 
"major-to-minor" sort. It is not possible Cusually) to sort all 
three variables and to maintain the one-to-one correspondence. Only 
one variable can be strictly sorted (the "major" sort variable). The 
other variables can at best be sorted within each of the distinct 
values of the major variable, since only such reordering will 
maintain the sorted order of the major variable’s values. The second 
variable sorted is said to be "more minor" than the major variable. 
The third variable is more minor still and may be sorted only within 
each combination of the distinct values of the two more major sort 
variables. And so it goes. The last sort variable is called the 
"minor" sort variable. 


How do you do a major-to-minor sort in APL? Backwards. Reorder all 
the sort variables (to maintain correspondence) by sorting the minor 
sort variable. Then reorder them by the next more major variable. 
And so on. The last variable sorted will be the major sort variable 
and so it will be in strictly sorted order. Since sorting does not 
change the relative order of the values which are equal, the effects 
of the earlier sorts will be preserved within each of the distinct 
values of the major sort variable. 


The solution is therefore: 


GRADE¢4ENUM 
ENUM¢ENUM[E GRADE J 
AGECAGE[ GRADE] 
OFFICE¢OFFICEL GRADE ] 


GRADE¢4AGE 
ENUM¢CENUM[ GRADE J 
AGECAGEL GRADE] 
OFFICE¢COFFICEC GRADE J 
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GRADE¢40FFICE 
ENUM¢CENUM[E GRADE] 
AGE¢AGE[ GRADE J 
OFFICECOFFICEL GRADE J 


If you study this solution, it may strike you that there is much 
reordering (indexing) going on needlessly. In particular, rather 
than reorder every variable after each grade operation, you can just 
reorder the grade vector. Using this approach, the solution becomes: 


GRADE¢4ENUM 
GRADE¢GRADE[ 4AAGEC GRADE J J 
GRADE*¢GRADE[ 40FFICEL GRADE] J 


ENUM¢CENUMLE GRADE] 
AGE¢AGEL[ GRADE ] 
OFFICECOFFICEL[ GRADE ] 


The processing cost using this latter solution increases linearly as 
the number of sort variables increases. Using the former solution, 
the processing cost increases exponentially. However, the latter 
solution lacks the clarity of the first solution and requires 
comments. In fact, the latter solution is sufficiently unclear that 
many APL programmers feel little remorse at jamming the first three 
lines together using embedded assignment. That solution is included 
here so that you will recognize it, not as an endorsement: 


GRADE*¢GRADE[ 40FFICE[ GRADE¢GRADEL[ 4AGE[ GRADECAENUM] ))) 


ENUM¢ENUM[E GRADE] 
AGE©AGE[ GRADE ] 
OFFICECOFFICE[ GRADE] 


Some implementations of APL support numeric matrix right arguments to 
grade-up and grade-down. If so, the resulting grade vector is the 
result of grading the columns of the matrix (from left to right) as 
major-to-minor variables. If your APL implementation supports this 
feature, you may solve the above problem with the following 
expressions: 


GRADEC4OFFICE,AGE,C1.5]ENUM Cin origin 1) 
ENUM*CENUM[I GRADE J 


AGE¢CAGEL[ GRADE] 
OFFICECOFFICE[ GRADE ] 
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PROBLEM: Suppose you have a 1000 row, 8 column character matrix, 
CMAT, of part numbers, one per row (e.g. 'AK10632B’). How 
can you construct a 1000 element grade vector which can be 
used to reorder the rows of CMAT such that the part numbers 
are in ascending (alphabetic) order? 


TOPIC: Character Matrix Sorting 


In the previous problem, all sorting was performed on numbers. When 
numbers are sorted up, they are in ascending order. That means 
smaller numbers precede bigger numbers. In this problem, we will 
sort characters, not numbers. What does that mean? If the 
characters are letters of the alphabet, it means that the earlier (Cin 
the alphabet) letters precede the later letters. For characters not 
in the alphabet we must decide their relative sorting "magnitude" and 
extend the alphabet accordingly. Such an extended alphabet is called 
a "collating sequence" and is used as a reference to determine which 
characters are bigger or smaller than a given character for sorting 
purposes. The following character vector represents a typical 
collating sequence. 


CS¢' .,:3-/6123456789ABCDEFGHIJ KLMNOPORSTUVWXYZAD ' 


A different collating sequence may define a different result when 
sorting a given character array. Therefore the collating sequence is 
a necessary parameter to the solution of the problem. To solve the 
problem, we will define a dyadic function CGRADEUP whose left 
argument is the collating sequence, whose right argument is the 
character matrix to be sorted and whose result is the desired grade 
vector: 


VY GRADE¢CS CGRADEUP CMAT 


If the rows of a character matrix are in sorted order, what 
characteristics do they have? The characters of the first column are 
in strictly ascending order (as defined by the collating sequence). 
The characters of the second column are in strictly ascending order 
within any distinct character in the first column. The third column 
is sorted within distinct combinations of values in the first and 
second columns. And so on. In other words, the rows of the matrix 
are reordered by using the columns (first to last) as the 
major-to-minor sort keys. 


Some APL systems have defined primitive dyadic grade-up and 
grade-down to solve this problem directly. The left argument of A or 
Y is the collating sequence. There is no need to define a CGRADEUP 
function. The solution to this problem is: 


GRADE¢CS4CMAT 
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On APL systems for which dyadic grade-up and grade-down are not 
implemented, a different approach is required. The most 
straightforward converts the character matrix to an integer matrix 
with the same shape whose values are the indices into the collating 
sequence of the corresponding characters. Then the integer matrix is 
sorted in major-to-minor order as done in the previous section. 

Since dyadic t is used to convert characters to indices, the 
characters in the character matrix which are not in the collating 
sequence will translate to 1 greater than the length of the collating 
sequence Cor to the length of the collating sequence if the index 
Origin is 0). Therefore, all characters not included in the 
collating sequence are treated as if they are at the end of the 
collating sequence. 


CLWSID: SORT] 
V GRADE¢CS CGRADEUP1 CMAT;I 
[1] a Returns grade vector for sorting rows of 
[2] a CMAT with collating sequence CS. 
[3] a Convert characters to indices: 
[4] CMAT<CStCMAT 
[5] a Index of last column as a scalar: 
C6] I¢CeCMAT)(1+010)]-~DI0O 
[71 aA Return trivial result of no columns: 
[8] %CIZOIO)eL1 
C9] GRADE¢lleeCMAT 
C10J 790 
[11] A Grade rightmost (minor) column: 
[12] L1:GRADE¢CACMATLE s I] 
[13] A Decrement column index and exit if done: 
[14] L2:»(OIO>I€¢I-1)0e0 
[15] A Grade next more major column: 
[16] GRADE¢GRADE[ ACMATEGRADE;1I]] 
[171] »L2 
V 


A more sophisticated technique packs several columns together at once 
so that fewer applications of grade-up (4) are required. For 
example, suppose you have a 9 column character matrix whose indices 
into the collating sequence are the following: 


3.27 16 9 8 415 8 33 
31 30 19 910 8 24 2 23 
2 3 19 16 12 #4 19 14 15 


* 
* 


By grouping the matrix into 3 groups of 3 columns and by packing each 
group into 1 column by respectively multiplying its columns by 10000, 
100 and 1 and adding, the result is: 
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32716 90804 150833 
313019 91008 240223 
20319 161204 191415 


Because of the nature of major-to-minor sorting and because of the 
scheme used to pack these numbers, you can then determine the grade 
vector of this 3 column matrix (third column first) and it will be 
the same as that of the 9 column matrix. The approach will probably 
be more efficient than the original approach because only 3 grade-up 
operations are needed rather than the original 9. 


Taking this approach to its logical extreme, you may argue to pack 
all 9 columns into a single column (vector) of large numbers: 


32716090804150833 
313019091008240223 
20319161204191415 


* 
a 
e 


However, the computer internally maintains only 16 or 17 digits of 
precision on any number. It sees the numbers as: 


3271609080415083_ 
3130190910082402__ 
2031916120419141_ 


Therefore, the last digit or two of these large packed numbers are 
insignificant to the computer when it is grading the vector and may 
produce incorrect grade indices for rows of the character matrix 
which are identical except in the last column or two. 


So how many characters can be packed together at once? This is a 
function not only of the internal precision of your APL 
implementation but also of the length of the collating sequence. In 
the illustration above, the indices were packed by multiplying by 
consecutive powers of 100. Smaller powers (say 80) can be used to 
result in smaller packed numbers and to allow more columns to be 
packed at once. But if the powers used are too small, the indices 
will not always pack to distinct numbers. 


For example, if the power 10 is used to pack the numbers 3 2 4 and 3 
114, the results will be the same. This problem arises only if the 
range of indices is greater than the power used. Since the range of 
indices is one greater than the length of the collating sequence, 
that is the power you should use. 


The following solution packs as many columns at once and performs as 
few grade-up operations as possible. 
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[CWSID: SORT] 
V GRADE€¢CS CGRADEUP2 CMAT;1I;COLS;N;P 
C1] aA Returns grade vector for sorting rows of 
[2] a CMAT with collating sequence CS. 
[3] aA Convert characters to origin 0 indices: 
[4] CMAT€ CCSLCMAT ) -O10 
[5] aA Number of columns as a scalar: 
[6] I€CeCMAT){[1+0I0] 
[7] aA Return trivial result of no columns: 
[8] 2%CI>0)eL1 
[9]  GRADE+¢iloeCMAT 
C10] +30 
[11] A Compute max. no. cols. to pack (if 16 digits 
[12] A precision): 
[13] L1:COLS¢L (P€1+eCS)@1E16 
[14] A Number of cols. to pack for first grade: 
C15] N€ILCOLS 
[16] GRADECAPLAQCMATL 5 CI-NJ+iUN] 
[17] Aa Decrement columns and exit if done: 
[18] L2:>»(0Z2I¢I-N)oe0 
[19] a Grade next group of more major cols.: 
C20] N¢ItlLCoLs 
C21] GRADE¢GRADE[AP1&CMATE GRADE; CI-N)+iNJ] 
[22] L2 
V 


This solution is an improvement over the prior solution only if the 
packing operation is fast relative to the grade operation. The 
relative speeds differ among APL implementations and hardware 
configurations. You should time the two solutions for your 
implementation. Use the fastest, unless you are paid by the hour. 


If you are familiar with the issue of comparison tolerance (system 
variable OCT) , you are aware that APL systems typically do not 
distinguish between values which differ only beyond the 14th Cor so) 
Significant digit. Yet, here we are packing numbers out to 16 
Significant digits. We can do this because grade-up (4) and 
grade-down (¥Y) are primitive functions which do not consider 
comparison tolerance (as do =, >, t, €, #, etc.) If your 
implementation of grade-up and grade-down does consider comparison 
tolerance, you should modify the above function to localize OCT in 
the header and to set OCT¢0O (full precision) on the first line of the 
function. 


Finally, APL implementations store small integer numbers more 
compactly than large integer numbers. Because of these differences 
in internal storage, grade-up is faster on small integers than on 
large ones. This difference may be so dramatic that you should pack 
fewer columns and do more grade-up operations on the small integer 
values. If so, you should change the reference to 1E16 in the above 
function to 2147483647 or 32767 or whatever your largest integer is 
(i.e. the largest number not stored as an 8 byte floating point 
number). 
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When working with character matrices which are wide and which have 
rows whose values are all significantly different (e.g. names), 
another solution to this problem becomes practical. The approach is 
to work with the columns in major-to-minor order. 


Sort the first column. Compare the sorted characters to their 


neighbor (prior and next row) characters. If both neighbor 
characters are different, the row is distinct and belongs in its 
current (sorted) position. If one or both of its neighbors have the 


same value as it has, we must proceed to the second column. Consider 
the second column for only the rows whose value is not distinct for 
the first column. Sort the second column within the values of the 
first column. Compare the sorted characters to its neighbors and 
again identify the rows which are still not distinct for the first 
two columns. And so on. 


Consider each successive column until all rows are known to be 
distinct or until you run out of columns. As fewer and fewer 
nondistinct rows remain, the grade-up operation will be performed on 
shorter and shorter vectors. Since the grade-up operation is quicker 
on short vectors than on long ones, this solution can be quite fast 
on matrices whose row values are mostly different. 


[WSID: SORT] 
V GRADE¢€CS CGRADEUP CMAT;C3F;3G;IsMsN;R; ROWS 
C1] aA Returns grade vector for sorting rows of 
[2] aA CMAT with collating sequence CS. 
[3] a Index of last column as a scalar: 
[4] N¢( oCMAT)(1+01I101]-~01I0 
[5] aA Return trivial result of no columns: 
[6] >(N20I0) eL1 
C7] GRADE¢tloeCMAT 
[8] >0 
[9] a Select first column: 
[10] L1:CeCMATE ;I<0I0) 
[11] A Convert characters to indices and grade them: 
[12] GRADE¢4CSLC 
C13] A Exit if 1 column or 1 or less rows: 
C14] %CCI=N) V12loeeCMAT) 00 
[15] A Sort characters: 
[16] C¢CC[GRADE] 
[17] a Flag first of groups of equal values: 
[18] F¢ec#™10C 
[19] A Handle incorrect result if all values equal: 
[20] FCOTOJ¢1 
[21] a Flag values still unresolved (i.e. more than 
C22] A 1 equal value): 
(23] M¢FA1OF 
[24] a Squeeze down flag-first vector: 
[25] F¢M/F 
[26] a Exit if none left to resolve: 
[27] 7»CeF)L0O 
[28] A Indices into GRADE of unresolved values: 
[293 ROWS*M/iL0eM 
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[30] 
C31] 
C32] 
[33] 
[34] 
[35] 
[36] 
C37] 
C38] 
[39] 
[40] 
[41] 
[42] 
C43] 
[44] 
[45] 
[46] 
[47] 
C48] 
[49] 
C50] 
C51] 
[52] 
[53] 
[54] 
C55] 
C56] 


VY CGRADEUP (Ccontinued) 
A Indices into CMAT of unresolved values: 


V 


LOOP: R©GRADEL ROWS ] 


A Increment column index: 


T¢I+1 


SORTING AND 


A Select Ith column for unresolved rows: 


C¢CMATER;T] 


A Convert and grade characters: 


GEaACSrc 


A Reorder grade vec to maintain 


G¢GLAC+\FICGI] 


A Insert reordered grade vec: 


GRADEL ROWS J¢R[G] 


Aa Exit if no more columns: 


>CI2=N)0e0 
A Sort characters: 
CeC[G] 


A Flag first, considering prior 


FEFVC#-10C 


Aa Flag values still unresolved: 


M¢FA1OF 

A Squeeze down 
FeM/F 

Aa Exit if none 
®CoFILO 

A Squeeze down 
ROWS¢M/ROWS 
+LOOP 


Aur PS Oe A Pus se ND Pe Ps A 


left to resolve: 


sorted prior 


columns too: 


flag-first vector: 


unresolved indices into GRADE: 


INS A A A Pus Pe Pe Ps As PRD Pus Ps PN Pus Ps A 


Sort the following character matrix, SUBJECTS. 


Lincoln 
troops 
liberty 
lasting 
brothers 
Grant 
Lee 
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Chapter 5 SORTING AND SEARCHING 
TOPIC: Uppercase/Lowercase Sorting 


Since this matrix contains both uppercase and lowercase letters, the 
collating sequence must contain both the uppercase and lowercase 
alphabets. Let’s try catenating then: 


CcS¢’ ABCDEF...XYZabcdef...xyz’ 
SUBJECTS[CSASUBJECTS ;$ ] Cuse CGRADEUP if 
Grant dyadic A is unavailable) 
Lee 
Lincoln 
brothers 
lasting 
liberty 
troops 


No good. Words beginning with the letter L should be together, 
whether the L is uppercase or lowercase. Let’s try interleaving the 
uppercase and lowercase alphabets: 


CS¢’ AaBbCcDcEeFf...XxYy2Zz' 
SUBJECTS[CS4SUBJECTS; ] 

brothers 

Grant 

Lee 

Lincoln 

lasting 

liberty 

troops 


Not guite. Although words beginning with the letter L are now 
together, those beginning with an uppercase L precede those beginning 
with a lowercase 1, regardless of the second letter in each word. We 
want all Ls to be treated equally, regardless of case. 


Since equality is our aim, let us promote each lowercase letter to an 
uppercase letter and try again. Suppose UPPERCASE is a monadic 
function which converts its character array argument to an array of 
the same shape and values except each lowercase letter has been 
replaced by the corresponding uppercase letter. Then, lowercase 
letters can be omitted from the collating sequence. The following 
solution does the job. 


CS¢€' ABCDEF...XYZ' 
SUBJECTS[CSAUPPERCASE SUBJECTS; ] 

brothers 

Grant 

lasting 

Lee 

liberty 

Lincoln 

troops 
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Notice that the character matrix is converted to uppercase letters 
for purposes of grading only. The original mixed case matrix is used 
for indexing. 


The technique of converting an array to uppercase letters is also 
useful for searching through mixed case arrays whenever the uppercase 
and lowercase characteristics of letters are to be ignored. For 
example, to list the words which begin with "LI": 


(CUPPERCASE SUBJECTS[3;1 2])A.='LI' J) 4SUBJECTS 
Lincoln 
liberty 


The following function will perform the desired translation to 
uppercase letters. 


CWSID: SORT] 

V R€EUPPERCASE C;FOUND; IND; LOWER;UPPER 
C1] a Converts the lowercase letters in the character 
[2] aA array C into the corresponding uppercase 
[3] a letters. Useful for sorting or searching 
[4] a character arrays when the case distinction is 
[5] a to be ignored. 
[6] LOWER¢ 'abcdefghijklmnopgrstuvwxyz '’' 
C7] UPPER¢ 'ABCDEFGHIJ KLMNOPORSTUVWXYZ'’' 
C8] R¢,C 
[9] a Inds of arg in LOWER (1+last ind if not found): 
C101] IND<LOWERLR 
C11] A Mark those found: 
[12] FOUND¢IND<OI0+eLOWER 
[13] A Insert UPPER elements in place of LOWER ones: 
C14] RCFOUND/LeFOUNDI]<UPPER[ FOUND/IND] 
C15] aA Reshape to original shape: 
[16] R€eCeC)oeR 


C17] A 
[18] A In APL2, no need to reshape: 
[19] A IND¢LOWERtu ,C 
C20] A FOUND¢IND<0OI0+o LOWER 
C211] A R€C 
[22] A (FOUND/ ,R)©UPPER[ FOUND/IND] 
[23] A 
[24] A Alternate algorithm... 
[25] A Construct OAV of only uppercase letters: 
[26] A AAV<DAV 
[27] A AAVCOAVLLOWERJ]<UPPER 
[28] A Perform transl from lower/upper OAV to upper AAV: 
[29] A R¢AAVLOAVLC]I 
V 


Implementations of APL which provide dyadic grade-up typically also 
provide a facility for handling this uppercase/lowercase problem 
directly. Specifically, the collating sequence left argument may be 


-63- 


Chapter 5 SORTING AND SEARCHING 


a matrix which contains both alphabets as two corresponding rows. 
For example: 


CS¢' ABCDEF...XYZ’',[0.51]' abcdef...xyz’ 
SUBJECTSL[LCSASUBJECTS; ] 

brothers 

Grant 

lasting 

Lee 

liberty 

Lincoln 

troops 


You should read your documentation for dyadic grade-up to understand 
the subtleties of this facility. Given the facility, the UPPERCASE 
function is not needed for sorting uppercase/lowercase character 
matrices. However, it is still useful for searching through 
uppercase/lowercase arrays. 


Pe PR AG Aus Pa Ps PD Pa NG ANS NS Pe ONG PN Pus Bs A Boe AN NS ON Bus Os PANG A 


PROBLEM: Given the policy numbers of 1000 existing policyholders (as 
a 1000 row, 12 column character matrix since the policy 
"numbers" may contain letters) and the policy numbers of 
600 non-smokers (600 rows, 12 columns), determine the index 
of the existing policy (1 to 1000) to which each of the 600 
non-smokers corresponds. The resulting integer vector will 
have 600 elements. Return the "index'' 1001 if the policy 
number is not found. 


TOPIC: Array Searching 


If the two lists of policy numbers were numeric vectors rather than 
character matrices, the solution would be trivial. Suppose BASE is 
the name of the 1000 element vector of existing policy numbers, VALS 
is the name of the 600 element vector of non-smoker policy numbers 
and INDS is the desired result. The following expression will solve 
the problem: 


INDS©BASEtLVALS 
(Note that the default comparison tolerance, i.e. OCT, will need to 


be reduced to make accurate comparisons between numbers with more 
than 14 or so digits.) 
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In APL implementations which support nested arrays, the dyadic tu 
function may be used to solve this problem, even on character 
matrices. The first step is to convert the matrix arguments to 
nested vector arguments (e.g. c{2]BASE in APL2). The solution then 
follows directly: 


INDS¢€(cl2]BASE)iLCCL21VALS ) Cin APL2) 
INDS¢€( L021 BASE)ILCLC2IVALS ) ‘Cin APL*PLUS) 
INDS<«(<eol1 BASE)ULC<9o1 VALS) Cin SHARP APL) 


In APL implementations which do not support nested arrays, a more 
creative approach is required since the dyadic t function does not 
operate as hoped for on character matrix lists. 


One effective approach is to convert the character matrix arguments 
into numeric vector arguments by converting the columns of characters 
into columns of indices and then packing the numbers together by the 
techniques of the previous sections. As mentioned there, only a 
limited number of characters may be packed into a single number 
without losing precision (say 8 to 12 character columns, depending 
upon the length of the character vector collating sequence used to 
convert the characters to indices). Further, since dyadic 1 uses 
comparison tolerance, the value of OCT should be set to zero to make 
comparisons which are as precise as possible. 


The following function uses this technique to emulate dyadic 1u on 
character matrices: 


C[WSID: SEARCH] 
V INDS¢BASE CMIOTA1 VALS;CS;P;0CT 
[1] aA Returns the row indices of BASE at which the 
[2] aA rows of VALS first match. 
[31 aA Set comparison tolerance to maximum precision: 
C4] OcT<0 
C5] a Determine collating sequence: 
C6] CS¢( (OAVE BASE) VOAVEVALS ) /OAV 
C7] aA Packing factor: 
C8] P€1+oCS 
[9] aA Pack and search: 
[10] INDS¢€CPLCCSL&BASE)-OL0O) LPs CCSt&VALS )-OIO 
V 


Unfortunately, this technique will not work on wide character 
matrices. Further, in some APL implementations, the decode (1) 
function is slow. Under either of these conditions, another approach 
is desired. 


Suppose BASE is a numeric vector and VAL is a numeric scalar. How 
can we find the index of the first occurrence of VAL in BASE? 


BASEUtVAL 
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How can we identify (by bits) all of the occurrences of VAL in BASE? 
BASE=VAL 


If VALS is a vector, how can we identify (by bits) all of the 
occurrences of VALS in BASE? 


BASEo.=VALS 


If BASE is a character matrix and VAL is a character vector (i.e. one 
policy number), how can we identify (by bits) all of the occurrences 
of VAL in BASE? 


BASEA. =VAL 


If VALS is also a character matrix, how can we identify (by bits) all 
of the occurrences of VALS in BASE? 


BASEA. =QVALS 


The information we seek is contained in the Boolean matrix result of 
this expression. Specifically, the column index of the first bit in 
each row is the index we seek. By using a Boolean scan, we can 
extract the indices: 


INDS¢OI0++/A\~BASEA. =QVALS 


We will modify this algorithm somewhat to replace row-wise (e.g. A) 
functions by the usually faster column-wise (e.g. A\) functions and 
to eliminate the not (~) function. 


[CWSID: SEARCH] 
VY INDS¢<BASE CMIOTA2 VALS 
C1] a Returns the row indices of BASE at which the 
[2] aA rows of VALS first match. 
{3] INDS¢€O1I0++/A\VALSY. #QBASE 
V 


This algorithm is an excellent illustration of the power of APL. 
Unfortunately, it has some drawbacks. For this example, the result 
of the v.# function is a 600 row, 1000 column Boolean matrix (600,000 
elements) which might generate a WS FULL error message. Even if it 
does work, the function will consume a large amount of CPU time while 
making the 600,000 comparisons. 


A different approach takes advantage of the speed of sorting 
algorithms. The following discussion assumes OI0O=1. 


1. Combine the two arguments via catenation into a 1600 row matrix: 


A¢BASE,C1JVALS 
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2. Sort the combined matrix using the CGRADEUP function developed in 
a previous section (or using dyadic 4 if available) and using OAV as 
the collating sequence: 


GRADECOAV CGRADEUP A 
A©A( GRADE; ] 


By sorting the matrix, like rows are now contiguous. 
3. Shift the rows of the matrix down one row and compare: 
FLAG¢V/A# 160A 


FLAG is a 1600 element Boolean vector whose 1s flag the first of each 
set of contiguous like-valued rows. 


4. For each of the 1600 rows, determine the index into the original 
unsorted catenated matrix of the first row of each set of contiguous 
like-valued rows: 


FIRST¢ (FLAG/GRADE)[+\FLAG] 


The 1600 elements of FIRST correspond to the rows of the sorted 
catenated matrix. 


5. Reorder the elements so they correspond to the rows of the 
unsorted catenated matrix: 


INDS€( pFIRST) 00 
INDS[GRADE1]1¢FIRST 


6. Select only those elements of INDS which correspond to the rows of 
VALS Cnot the rows of BASE): 


L¢1ToBASE 
INDS¢LLINDS 


7. Set the elements of INDS which correspond to rows of VALS for 
which no matching row was found in BASE to the "not found" index (1 
plus the number of rows in BASE): 


INDS¢INDSL1+L 


This approach is quite efficient for large arguments. However, 
because it requires so many steps, other algorithms may be more 
efficient for small arguments. In particular, inner product (A.=) is 
typically quite fast when one argument is a matrix and the other is a 
vector. Therefore for a small (few rows) right argument of CMIOTA, 
it may actually be faster to loop on the rows of the right argument 
(using A.= to search through the rows of the matrix left argument) 
than to employ this catenating, sorting, shifting, comparing 
algorithm. 
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But how small should the right argument to CMIOTA be before we switch 
to a looping algorithm? Let us assume the arguments to CMIOTA are L 
and R: 


I¢L CMIOTA R 


The CPU time consumed by the looping algorithm increases linearly 
with the number of rows in R (for a constant L) and linearly with the 
number of rows in L (for a constant R). Therefore, the CPU time 
consumed will be a function of the formula: 


CPUL = C1+(CRRxX(C2+(C3xXRL) )) 


where CPUL is the amount of CPU time consumed by the looping 
algorithm, RR and RL are the number of rows in R and L respectively, 
and C1, C2 and C3 are constants to be determined. 


The CPU time consumed by the sorting algorithm increases linearly 
with the sum of the numbers of rows in Rand in LL. Therefore, the 
CPU time consumed will be a function of the formula: 


CPUS = C4+(C5xCRR+RL) ) 


where CPUS is the amount of CPU time consumed by the sorting 
algorithm and C4 and C5 are constants to be determined. 


The values of Cl, C2, C3, C4 and C5 for the formulas above will 
depend upon the particular machine and APL implementation. To 
determine them for your environment, you must time the two algorithms 
for a variety of arguments and then use the techniques of least 
squares to find the constants which define the "best" curves to fit 
the empirical data. There is a problem at the end of the chapter on 
Computer Efficiency Considerations which performs the first task and 
a problem at the end of the chapter on Curve Fitting which performs 
the second task. Work these problems and plug the derived values 
into the CMIOTA function below (in place of Cl, C2, C3, C4, C5). 


(The formulas above do not consider the number of columns in the 
matrix arguments nor the nature of the data, i.e. whether and where 
the values are found. Therefore they are not precise formulas. 
However, they will be sufficiently accurate to insure that the best 
algorithm is used in all but borderline cases. ) 


The following CMIOTA function uses the approaches discussed above and 


has been extended to handle origin 0 and to treat the trivial cases 
Cempty or l-row arguments) separately. 
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C[WSID: SEARCH] 


V INDS¢BASE CMIOTA VALS;A;F;G;1;1L 


a Returns the row indices of BASE at which the 


A rows of VALS first match. 

a Branch if right arg a matrix: 
>(C2=opeVALS)oL1 

A Handle vec or scalar right arg: 
INDS¢( BASEA. =VALS) 11 
>0 

L1: L€¢(oBASE){OI01 
A¢( eVALS) [O10] 

Aa Branch unless no rows in either arg: 
>(xF¢ALL) oL2 

A Handle empty arg: 
INDS€AeOIO 
>0 

Aa Branch if both args have more than 1 row: 

L2:9(F¥1)eL4 

A Branch unless left arg has 1 row: 
+(L#1)eL3 

A Handle 1 row left arg: 
INDS¢0I0+VALSY.#, BASE 
>0 

A Handle 1 row right arg: 

L3: INDS¢, (BASEA.=,VALS)11 
>0 

A Branch if sort alg. 


costs more than looping alg.: 


A (remove a after replacing C1,C2,C3,C4 by 

A computed constants): 

L4: A%((C4+C5xXL+A)>C1+AxC2+C3xL) eo L5 

A Combine args. and sort (like values together) 


A Cuse CGRADEUP if dyadic 4 unavailable): 
G¢HUAVAACBASE, COIOJVALS 
AcAl[G; ] 

Aa Flag lst of distinct rows by shifting and comparing: 


Fev/A# 100A 

A Insure lst elt is 1 Cin case all rows the 
FCOIOI]¢1 

a Indices of 1st distinct rows: 

I¢F/G 

A Replicate for each like row: 

FLCOTOJ¢OIO 

T¢1L(C+\F] 
A Unsort 
INDS¢I 
INDS(GIJ¢I 

A Keep those corresponding to right arg: 


indices (to catenated order): 


INDS¢€¢LLINDS 

A Set ‘not found’ inds to ‘one greater’: 
INDS¢€INDSLL+UO1I0 

>0 
Aa Use looping algorithm if more efficient: 


L5: INDS<Ap0 
L¢(AoeL6) ,0 
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VY CMIOTA (continued) 
C53] L<O0IO 
[54] L6:INDSCLI1<CBASEA.=VALS[I;])11 
[55] 2»LET¢et+1]I 

V 


Pa BA A As PG BR MB A Pe As Bo Pua Pa NG ANG Pr OS PN Pu PN As Pre NS NGS NG 


PROBLEM: Suppose you have a 1000 element vector of ages. You wish 
to group the ages into the 10 ranges: 0 to 9, 10 to 19, 20 
to 24, 25 to 29, 30 to 34, 35 to 39, 40 to 49, 50 to 59, 60 
to 64, 65 and up. What approach would you take to translate 
these 1000 ages into 1000 corresponding range indices (i.e. 
numbers between 1 and 10)? 


TOPIC: Range Searching 


Suppose the 1000 element vector of ages is named AGES. Let us define 
a 10 element vector LOWER of the lower limits for the specified 
ranges: 


LOWER©CO 10 20 25 30 35 40 50 60 65 


We need to compare each element of AGES to each element of LOWER and 
to determine the index into LOWER of the last element which is less 
than or equal to the element of AGES. Outer product may be used to 
solve this directly: 


INDS¢«+/AGES°.2LOWER 


This expression is simple and powerful but suffers from the malady of 
all outer product solutions. Since every element of the left 
argument is being compared to every element of the right argument, 
the number of comparisons increases exponentially as the lengths of 
the two arguments increase linearly. Hence, the solution is slow and 
expensive when performed on two long vectors. 


A more efficient (for long arguments) algorithm can be developed 
using the same sorting technique employed in the prior section. 


1. Combine the two arguments and determine the grade vector: 


A©LOWER, AGES 
GRADE¢4A 
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2. Rather than reorder the elements of the catenated array, reorder 
the elements of an array of 1s and Os where the is mark elements of 
LOWER and the Os mark elements of AGES: 


FLAG¢( CeA) TC oe LOWER) 01) [GRADE J 


3. Determine the index into LOWER of each element of the sorted 
catenated array: 


FIRST¢+\ FLAG 
4. The elements of FIRST correspond to the elements of the sorted 
catenated array. Reorder the elements so they correspond to the 


elements of the unsorted catenated array: 


INDS¢( p FIRST) 00 
INDS[GRADE]¢FIRST 


5. Select only those elements of INDS which correspond to the 
elements of AGES (Cnot the elements of LOWER): 


INDS¢Coe LOWER) LINDS 


The following function LIOTA uses this approach but is extended to 


handle origin O and to return 1 greater than the largest index if the 


corresponding value is less than the smallest lower limit. The left 
argument 1s assumed to be in ascending order. 


C[WSID: SEARCH] 
V INDS¢€LOWER LIOTA VALS;3A3F3G;1I;L 
[1] a Returns the indices of LOWER at which the 
[2] a elements of VALS first match or exceed. 
[3] aA Branch unless right argument empty: 
[4] %CxpVALS)eL1 
C5] INDS€10 
C6] +0 
[7] A Combine arguments and sort: 
C8] L1:G¢s4ACLOWER, VALS 
[9] aA Flag elements from LOWER in sorted array: 
[10] L¢eLOWER 
[11] FeCCeA)TLe1)(G] 
[12] A Determine indices into LOWER (origin dependent): 
C13] FLOTOJ¢FLOLIOJ-~oO1I0 
[14] I¢+\F 
[15] a Unsort indices (to catenated order): 
C16] INDS¢I1 
[17] INDS([GI¢I 
[18] aA Keep those corresponding to right argument: 
[i9] INDS¢LJINDS 
[20] aA Set ‘not found’ indices to 'one greater’: 
[21] INDS[ CINDS=O1I0-1)/tpINDSI©L+OI0 
V 
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The solutions presented here are oriented around lower limits of 
ranges (in ascending order). If upper limits are considered (e.g. 
UPPER¢9 19 24 29 34 39 49 59 64 99), the solutions must be modified 
accordingly: 


Lower limits (ascending): +/AGES°e.2LOWER 
or: LOWER LIOTA AGES 


Upper limits Cascending): 1++/AGES°.>UPPER 
or: UPPER UIOTA AGES 


Lower limits (descending): 1++/AGES°.<LOWER 
Upper limits (descending): +/AGES°.=UPPER 


The following function UIOTA works like LIOTA but requires a vector 
left argument of range upper limits in ascending order. 


CWSID: SEARCH] 
V INDS¢UPPER UIOTA VALS;SA;F;5G;1I;L 
[1] aA Returns the indices of UPPER at which the 
C2] aA elements of VALS last match or are less than. 
[3] a Branch unless right argument empty: 
[4] %C€xpVALS)eL1 
C5] INDS€10 
C6] >0 
[7] a Combine arguments and sort: 
[8] L1:GeAéA¢VALS , UPPER 
[91 aA Flag elements from UPPER in sorted array: 
[10] L¢eUPPER 
[11] F€((-pA)TLe1)[G] 
[12] A Determine indices into UPPER Corigin dependent): 
[13) I¢e+\ 11l0I0,F 
[14] A Unsort indices (to catenated order): 
C15] INDS¢I1 
C16] INDS[G]J¢I 
[17] A Keep those corresponding to right argument: 
[18] INDS€(eVALS)peINDS 
4 


You should be aware that the LIOTA and UIOTA functions may not 
produce correct results when operating on floating point numeric 
vectors whose values are approximately equal (within comparison 
tolerance) to elements in the lower limit or upper limit vector. 

This aberration occurs because the grade-up (4) function used in 
LIOTA and UIOTA does not consider comparison tolerance when sorting. 
Thus, two numbers which would be treated as equal by the relational 
functions (say 2 or >) are treated as distinctly different numbers by 
grade-up. If this is a likely problem for a particular application, 
you should use the appropriate outer product solution. 


Let's consider an alternate algorithm for solving this range 
searching problem. The algorithm involves "ranking vectors". The 
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ranking vector of a vector V is computed via 44V and indicates the 
relative magnitudes of the values of V. The smallest value in V is 
assigned the index 1 (in origin 1), the second smallest the index 2, 
the third 3 and so on. For example, 


A415 5 10 15 20 
3124 5 


Notice that in the event of ties, the earlier values receive the 
lower rankings. In this example, the first 15 is ranked 3rd and the 
next is ranked 4th. 


Consider what happens to the rankings of these values when more 
values are catenated to the vector. For example, 


4415 5 10 15 20,13 17 
41257 3 6 


Notice that the corresponding rankings (4 1 2 5 7) have increased by 
the number of catenated values which are less than the respective 
values. 


4125 7-3 124 5 
10012 


That is, no catenated values are less than the 5 or 10: one catenated 
value (13) is less than the two 15s; and two catenated values (13 17) 
are less than the 20. 

Consider what happens when some of the catenated values are equal to 
values in the original vector. For example, catenating 13 15 instead 
of 13 17: 


4415 5 10 15 20,13 15 
412573 6 


We get the same result. However, notice what happens when the 
catenated values are placed at the front of the vector: 


The result now indicates the number of catenated values which are 
less than or equal to each value, not just less than. 


Given this behavior, the LIOTA and UIOTA algorithms follow directly: 
LIOTA: INDS¢( Coe LOWER) JASLOWER, AGES )-AAAGES 


UIOTA: INDS¢1+(€(CeAGES ) TASAGES , UPPER) -A4AAGES 
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These algorithms produce correct results for origin 1. The LIOTA 
algorithm returns 0 (instead of 1+@LOWER) for values of AGES which 
are less than the smallest value in LOWER. The two functions listed 
below, LIOTA1l and UIOTA1, work like the LIOTA and UIOTA functions 
above but use these ranking vector algorithms. The algorithms have 
been modified to work correctly in either origin and to return the 
correct "not found" value (COIO+eLOWER). 


Further, a more efficient method for computing the ranking vector is 
employed. When sorting a grade vector, traditional sorting logic is 
not needed. Index assignment will suffice. The following four sets 
of expressions generate equivalent results: 


R¢aAaVv GeaV GeAV GeAaVv 
R¢4G R¢€CoV)e0 R€G 
REGIJ€leG REGJ€¢teG 


The last set of expressions is the most efficient. Since it is not 
as clear as the first expression, it should include a comment: 


A REAAV 
R€GeaVv 
REGI¢teG 


Using this technique, the LIOTA1 and UIOTA1 functions each perform 
only two grade-up operations instead of four. However, the LIOTA and 
UIOTA functions above each perform only one grade-up operation and so 
will typically be the faster functions. Time them in your APL 
implementation. 


C[WSID: SEARCH] 
V INDS¢LOWER LIOTA1 VALS;G;L;R;S 
C1] aA Returns the indices of LOWER at which the 
[2] aA elements of VALS first match or exceed. 
[3] L*epLOWER 
[4] A R€EAAVALS : 
[5] R¢G¢AVALS 
C6] REIGJ¢ieG 
[7] A S¢44LOWER, VALS : 
C8] S¢GCALOWER, VALS 
[9] S[G]«ieG 
[10] A Origin 1 indices: 
C11] INDS€C(LJS)-R 
[12] a Set 'not found’ indices to ‘one greater’: 
C13] INDS{£CINDS=0)/LeINDSIJ¢L+i 
[14] aA Change from origin 1 to origin O if needed: 
[15] ~DI0e0 
C16) INDS¢INDS-1 
V 
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CWSID: SEARCH] 
V INDS€UPPER UIOTA1 VALS;3G;R;S 
[1] a Returns the indices of UPPER at which the 
[21 a elements of VALS last match or are less than. 
[3] A REAAVALS : 
[4] R¢GeAVALS 
[5] RUIG]¢1eG 
C6] A S¢AAVALS , UPPER 
C7] S¢+G¢AVALS , UPPER 
C8] SEG]€¢leG 
[97 aA Origin 0 indices: 
[10] INDS<( CeVALS)eS)-R 
[11] a Change from origin 0 to origin 1 if needed: 
C12] 20T0OL0 
[13] INDS€INDS+1 
V 
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PROBLEM: Write a function named ASS (string search) which will 
locate every occurrence of a character vector substring 
(right argument) in a character vector (left argument). 
The result is a Boolean vector of the same length as the 
left argument whose 1s flag the indices at which each match 
begins. For example: 


‘THIS IS A TEST’ ASS ‘IS’ 
001002120000000 0 


TOPIC: Character Substring Searching 
Some APL implementations have primitive functions which solve this 
problem directly: 
APL*PLUS: 
V BIT€CVEC ASS SUB 


C1] BIT¢CVEC OSS SUB 
Vv 
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APL2: 


V BIT¢CVEC ASS SUB 
Cid BIT«SUBECVEC 
V 


If such a primitive function is unavailable to you, you must work a 
little to get the desired result: 


C[WSID: SEARCH] 

V BIT€CVEC ASS SUB;C3;S 
C1] a Returns bit vector of length CeCVEC) with 1s 
[2] a flagging starts of substrings which match SUB. 
[31  CeeCVEC 
[4] S¢o,SUB 
[5] BIT¢CTt(-S)LSUBA.=(S,C+#XC) OCVEC 

V 
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PROBLEM: Write functions REPLACE and BY which will replace all 
occurrences of a character vector substring in a character 
vector by a second substring. For example: 


‘THIS IS A TEST’ REPLACE 'IS' BY 'ARE' 
THARE ARE A TEST 


Use the function ASS defined in the prior section. 


TOPIC: Character Substring Replacement 


The BY function is used simply as a syntactic convenience to provide 
three arguments to the REPLACE function. One approach is to assign 
the right argument to a global variable (say <by>) and to return the 
left argument as the explicit result. 


[WSID: SEARCH] 
V REA BY B 


[1] aA Used in conjunction with REPLACE as: 
[2] A 
C3] A ‘THIS IS A TEST’ REPLACE ‘IS’ BY 'ARE’ 
C4] A 
[5] by¢+B 
C6] R¢A 
Vv 
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The REPLACE function will generate a result by analyzing its two 
arguments and its third global "argument" <by>. When done, REPLACE 
erases <by> so that it will not be left global. 


The approach taken by REPLACE is the following: 


1. Use ASS to find the occurrences of the old substring in the 
character vector. Convert the bits to indices. 


2. Create a replication vector (i.e. left argument to /) which can be 
used to both squeeze out the old substring and to allow room for the 
new substring. Perform the replication on the character vector. 


3. Since the length of the character vector has changed Cunless the 
new substring has the same length as the old substring), adjust the 
indices computed in step 1 to point to where the new substrings must 
be inserted. 


4. Insert the new substrings. 


[WSID: SEARCH] 

V NVECCOVEC REPLACE SUB; BIT; IND;NHITS;3REP;SIZE;O1I0 
[11 a Replaces all occurrences of SUB in OVEC by <by> (set 
[2] aA in BY), erases <by> and returns the modified OVEC. 
[3] aA Requires subfn ASS Cor OSS or é€). 
[4] a The logic is a bit simpler using origin 0: 
[5] OTO<¢0 
[6] a Locate the starts of the old substring: 
C7] BIT€OVEC ASS SUB 
C8] aA Convert the bits to indices: 
[9] NHITS¢eIND¢BIT/teBIT 
[10] A Initialize replication vector as 1s: 
[11] REP¢(eBIT)el 
[12] A Insert Os where old substrings are: 
[13] REPLINDo.+t10,SUB1¢€0 
C14] A Insert new substring length where new substrings 
[15] aA will begin: 
[16] REPCINDI¢SIZE¢o, by 
[17] A Squeeze and expand OVEC with replicate: 
[18] NVEC¢REP/OVEC 
[19] a Adjust old indices to get new indices: 
[20] IND¢IND+(SIZE-oe ,SUB)XLNHITS 
[21] aA Insert new substrings: 
[22] NVECCINDo.+iSIZEJ¢(NHITS ,SIZE) opby 
[23] A Erase <by>: 
[24] BIT¢OEX ‘by’ 

Vv 
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PROBLEMS: (Solutions on pages 334 to 336) 


1. Given a 3 column integer matrix PNUM of telephone numbers (area 
code, phone number, extension, e.g. 213 5550123 1234), how can 
you sort the numbers in ascending order? 


2. Modify the CMIOTA function described in this chapter to define a 
function IOTA which works on numeric vector arguments instead of 
character matrix arguments. Test it in your APL implementation. 
Which is faster, IOTA or dyadic ut (see Computer Efficiency 
Considerations chapter)? What is the consequence of the 
dependence of dyadic 1 on OCT (comparison tolerance) and the 
independence of 4 on OCT? 


3. Given a 3 column integer matrix PNUM of telephone numbers (area 
code, phone number, extension) and a 3 element integer vector P 
which represents a particular telephone number, determine the 
index of the first row of PNUM in which P is located. 


4. Using the LIOTA function developed in this chapter, determine in 
which salary grouping each of the elements of the vector SALARY 
belong. The groupings are: (1) 1000 to 9999; (2) 10,000 to 
19,999; (3) 50,000 to 69,999; (4) 100,000 and up. Return the 
index 5 for elements of SALARY in none of these groupings. 


5. Using the ASS function developed in this chapter, write a monadic 
function DEB which will delete extraneous (leading, trailing or 
contiguous) blanks from its argument and will return the 
compressed result. For example: 


DEB ’ TOO MANY SPACES. 
TOO MANY SPACES. 


6. In a numeric vector NVEC, the value “1 represents "unknown". 
Display NVEC, showing each occurrence of ~1 as the characters 
'N/A’ Cnot applicable). 
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7. Suppose you have a 25 column character matrix of employee names, 
ENAMES. Each row contains one name, left-justified. The names 
contain both uppercase and lowercase letters. Display the names 
which contain the string "son" anywhere in the name. 
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SELECTING 


This chapter deals with the task of data selection in APL. 
Selection is the process of extracting elements from an APL array. 
The reverse process, replacing the values of elements within an 
array, or selection assignment, is also considered. Finally, a 
special selection task is covered: the task of selecting those 
values in an array which are unique (or distinct). 
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PROBLEM: Given a three element vector NVEC, what APL expression will 
return the first two elements? What expression will 
replace these elements in NVEC by the values 10 and 20? 


TOPIC: Selection and Selection Assignment 


There are basically 3 selection techniques available in APL. 


1. Indexing. Use indexing ([1]) when you know the positions within 
the array of the elements to be selected. For example (in origin 1): 


NVEC[1 2] 
2. Take/drop. Use take (T) or drop (1) or both when the elements to 
be selected are contiguous, especially at the start or end of the 
array. For example: 


2TNVEC 
~1LINVEC 
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3. Compression. Use compression (/) when you have a corresponding 
Boolean compression vector whose ones flag elements to be selected. 
For example: 


1 1 O/NVEC 


Typically, the compression vector is the result of a relational or 
logical expression which defines some criteria by which elements are 
to be selected. 


Though there are 3 selection techniques, the only selection 
assignment technique available in APL is index assignment. For 
example: 

NVEC[E1 2]€10 20 


If the nature of the selection assignment problem is oriented more 
toward take/drop or compression logic, you must convert the selection 
values to indices so that you may use index assignment. For example: 


NVECCT2TLPNVECI€10 20 
NVECL 1LteNVECI]€<10 20 
NVECC1 1 O/LeENVECI€¢10 20 


It is because of this need to convert to indices when performing 
selection assignment that the APL idioms tp and /loe are so common. 


In APL2, the APL language has been extended to allow direct selection 
assignment without first converting to indices. For example: 


NVECE1 21€10 20 
C2TNVECI€10 20 

C “1JNVEC)€10 20 

C1 1 O/NVEC)€10 20 


In fact, fairly complex selection assignment expressions are 
permitted. The expression, 


(39 110NVEC)€10 
is equivalent to: 
NVEC[ 3e110lLpNVECI€10 


The enhancement, when not abused, is a welcome extension to the 
language. 
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PROBLEM: Suppose you have constructed a matrix DEPN of annual 
depreciation rates to be used for assets which have 
depreciable lives of 20 years. DEPN has 20 rows and 12 
columns. DEPN[Y;M] is the fraction of the asset to be 
depreciated in the Yth year of its life for an asset 
purchased in month M (1 for January, 2 for February, and so 
on). Suppose YEAR is a 1000 element vector of the ages (1 
to 20) of 1000 assets and MONTH is a 1000 element vector of 
the months of purchase (1 to 12) for the corresponding 
assets. Determine the annual depreciation rates for these 
assets. 


TOPIC: Scattered Point Indexing 


A common mistake made when solving this problem is to try the 
following: 


DEPN{C YEAR; MONTH] 


This expression shows nicely what you want to do but unfortunately 
does not do it. The shape of the result of matrix indexing is the 
catenation of the shape of the row indices with the shape of the 
column indices. Since YEAR has shape 1000 and MONTH has shape 1000, 
the result has shape 1000 1000. These 1,000,000 elements are the 
rates for every combination of the elements of YEAR and the elements 
of MONTH. 


If you can picture this 1000 by 1000 element matrix in your mind's 
eye, you can see that the desired rates are sitting on the diagonal. 
The other rates are superfluous. If you have experimented much with 
dyadic transpose (&), you know that it can return the diagonal 
elements of a matrix argument by providing a left argument of 1 1 (Cin 
origin 1). Therefore, a correct expression to solve this problem is: 


1 1 &DEPN[ YEAR; MONTH] 


Unfortunately, this expression requires room in your workspace for 
the temporary 1000 by 1000 table. This may cause a WS FULL error. 
Even if available workspace is not a problem, the extraction of 
1,000,000 rates when you need only 1000 is extremely inefficient. 


An alternate approach to this problem is to view DEPN as a vector. 
The vector has 240 elements and is derived by raveling the matrix 
DEPN. Our job is to pack the vectors YEAR and MONTH into a single 
vector of indices into the raveled DEPN. The desired indices may be 
computed by the expression, 


MONTH+12xXYEAR-1 
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Thus, the desired result may be computed from the expression, 
(, DEPN) (MONTH+12xYEAR-1] 
This type of problem is called a "scattered point indexing" problem 
because the desired elements to be selected from the matrix are 
scattered throughout it. Normal matrix indexing (MAT[ROWS;COLS]) is 
useful only when the elements to be selected are in a rectangular 
pattern. 
Let us state the scattered point indexing solution in general terms: 
( , MATRIX) [COLUMNINDEX+NUMCOLS xXROWINDEX-1] 


For a 3-dimensional array, the solution is: 


( , ARRAY ) [COLUMNINDEX+ (NUMCOLS xROWINDEX- 1) + C(NUMCOLS xNUMROWS ) x 
PLANEINDEX-1] 


Or: 


( ,ARRAY ) [COLUMNINDEX+NUMCOLS x (ROWINDEX~— 1) +NUMROWS x 
PLANEINDEX-1] 


When performing scattered point indexing in origin 0 (OI0¢0), the 
"-1" portions of the above expressions disappear: 


Matrix Corigin 0): 
(,MATRIX) [COLUMNINDEX+NUMCOLS xROWINDEX] 
3-D array (origin 0): 
(, ARRAY ) [COLUMNINDEX+NUMCOLS xROWINDEX+NUMROWS x PLANEINDEX] 


For this reason, scattered point indexing is frequently done in 
origin 0. 
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PROBLEM: What algorithm may be used to return the unique values (UN) 
from a numeric vector (NV)? The unique values (UC) from a 
character vector (CV)? 


TOPIC: Unique (Distinct) Values 


Determination of the unique, or distinct, values is a common problem 
in the world of data processing. For example, given 1000 sales 
transactions which each include the salesperson number, you may want 
to compile a list of the numbers of the salespeople who had sales. 
If only 60 salespeople accounted for all 1000 sales, you would want 
to determine the numbers of those 60 salespeople. (You might also 
want to know the number of sales and the total dollar value of the 
sales attributed to each salesperson. These topics are covered in 
the next chapter. ) 


To illustrate the algorithms discussed in this section, we will use 
the following vectors: 


NV 

30 20 20 30 10 50 10 10 
CV 

BOOKKEEPER 


Our task is to return the vectors UN and UC: 


UN 
30 20 10 50 
UC 
BOKEPR 


Since the problem of determining distinct values can be viewed as a 
searching problem, the most obvious algorithm uses the APL searching 
primitive, dyadic t. Consider the result when you search the 
elements of a distinct vector for its own elements: 


897152189 7 15 
123 4 


However, if the elements are not distinct, the pattern of the result 
is not so regular: 


NVUNV 
L222 6 So 5 


In fact, wherever the result deviates from the vector of generated 
indices (tpNV), the corresponding element is a repeat value, i.e. has 
occurred earlier in the vector. To flag the distinct values then: 


(NVLNV) =LpNV 
110031100 
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And to select the distinct values: 


O<UN¢ ( C(NVLNV) =LeNV) /NV * Algorithm 1 * 
30 20 10 50 


Notice that this algorithm returns the distinct values in the same 
order as they first appear in the target vector. This algorithm also 
works on character vectors: 


( CCVLCV) =LepCV)/CV 
BOKEPR 


Since this algorithm depends upon the behavior of dyadic 1, it may 
also be used with functions which emulate dyadic 1. For example, if 
your task is to determine the distinct rows in the character matrix 
NAMES, you may do so with the following expression (given CMIOTA from 
the previous chapter): 


C(CNAMES CMIOTA NAMES )=tleeNAMES ) NAMES 


When uSing this algorithm on a large vector (say, 2000 or more 
elements), the dyadic t portion of the algorithm may require a 
Significant amount of processing time. A more efficient algorithm 
may be constructed which uses the (typically very efficient) grade-up 
(4) primitive function. 


Sort the vector: 


O<SORTED¢ENVLE ANVI 
10 10 10 20 20 30 30 50 


Shift the elements of the sorted vector to the right and compare to 
the sorted vector to flag the first distinct value in each run of 
like values: 


~1®SORTED 
50 10 10 10 20 20 30 30 
O¢FIRST¢SORTED# 10SORTED 
1003120101 


Unfortunately, if the values in the vector SORTED are all the same, 
the vector FIRST will be all zeros. Yet the first value of FIRST 
should be 1. The following expression will set the first (in either 
origin) element to 1, and will have no effect if FIRST is an empty 
vector: 

FIRST([ UxXpFIRSTJ]¢1 
Finally, select the first distinct value in each run: 


O<¢UN¢FIRST/SORTED 
10 20 30 50 
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Notice that this algorithm returns the distinct values in ascending 
order (descending if ¥ is used). The entire algorithm follows: 


SORTED¢NVIANV J * Algorithm 2 * 
FIRST¢SORTED# 10SORTED 

FIRSTCUXpFIRSTI¢1 

UN¢FIRST/SORTED 


The algorithm works on character vectors once you manage to sort the 
characters. If your implementation of APL supports dyadic grade-up, 
you may replace the first statement by: 


SORTED¢CV(OAV4CV ] 
If it does not, you may replace the first statement by: 
SORTED¢CV[ANAVLCV] 


As with numeric vectors, the distinct elements in the final result 
are in ascending order (where DAV, the atomic vector, defines the 
collating sequence). 


You may determine the distinct rows (UNAMES) of a character matrix 
(NAMES) using this algorithm if you are able to sort the matrix. If 
your implementation of APL supports dyadic grade-up, do the following: 


SORTED¢NAMES [ DAVANAMES; J 
FIRST¢v /SORTED# ~1eSORTED 
FIRSTL XLpFIRSTI€1 
UNAMES¢FIRST/SORTED 


If your implementation does not support dyadic grade-up, use the 
CGRADEUP function developed in the previous chapter. 


The expression CV[ADAVLCV] in the statement above suggests another 
algorithm for determining the distinct elements of a character 
vector. Consider the meaning of the expression CVLOAV, or better 
still, the expression DAVecV. The result of the latter expression is 
a 256 element Boolean vector that flags the elements of DAV which are 
in CV. Since the elements of OAV are distinct by definition, the 
remaining step is to use the Boolean vector to select the 
corresponding elements from DAV. The complete algorithm: 


UCe(OAVeECV) /OAV * Algorithm 3 * 
Like algorithm 2, this algorithm returns the distinct values in 
ascending order (according to OAV). In some implementations of APL, 
€ is extremely fast on character data. In others, it is not. If 
not, use algorithm 1 or 2. 


Extending this algorithm to numeric (Cor at least integer) data will 
produce ridiculous expressions like: 


UN¢ CC LLES0O) ENV) /1T1E30 
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which will hopefully generate a WS FULL error message rather than run 
into the next century. When we stop to consider the philosophy of 
the algorithm rather than its implementation, however, a very clever 
algorithm emerges. 


The philosophy is to consider the nature of the values in NV. The 
expression above assumes that all elements of NV are integers between 
1 and 1E30 Corigin 1). A more realistic problem would have the 
values between, say, 1 and 100. Let us view these values as indices 
rather than numbers and use them as such. Initialize a Boolean 
vector to have 100 zeros: 


BIT¢10000 


Use the numeric (index) vector to index assign 1s into the Boolean 
vector: 


BITEINVI€1 


Duplicates in NV are essentially discarded. The final step is 
Similar to the final step in algorithm 3: 


UN©BIT/1100 


Notice that this algorithm returns the distinct values in ascending 
order, that it works only on positive integers (O0IO and up), that the 
maximum value (MAX) must be known and not too large (else WS FULL on 
MAXe0). This algorithm is extremely fast on vectors of any size and 
for any APL implementation. The entire algorithm follows: 


BIT¢ (MAX+~0IO) 00 * Algorithm 4 * 
BITCNVIJ¢1 
UN¢BIT/tUeBIT 
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PROBLEM: For each of the unique-value algorithms presented in the 
last section, what additional logic must be included to 
also return the indices (INDS) of the numeric vector (NV) 
or character vector (CV) into the vector of distinct values 
CUN or UC)? For example, suppose NV and UN have the 
following values: 


NV 

30 20 20 30 10 50 10 10 
UN 

30 20 10 50 


The desired value if INDS is: 


INDS 
122313 4 3 3 


TOPIC: Translating Distinct Values to Distinct Indices 


The obvious solution to this problem is: 
INDS¢CUNLUNV or INDS€UCLCV 


These indices are useful when performing frequency counts or 
accumulations. These topics are covered in the next chapter. 


While obvious, the dyadic t approach may not be the most efficient 
technique for computing these indices, depending upon the 
unique-value algorithm being used. 


In the case of Algorithm 1, a dyadic t is already being performed so 
there is no need to do it again. Instead: 


UN¢ (CB¢CI¢NVUNV) =LeNV) /NV * Algorithm 1 * 
INDS€(B\leUN) [TI] 
or: INDS¢€(+\B)CIJ-~0O1I0 


The expansion in the second statement or the cumulative sum in the 
third are typically more efficient than another dyadic lu. 


In the case of Algorithm 2, the grade-up operation removes the need 
to perform dyadic 1. Instead: 


SORTED¢NV[IGeANV] * Algorithm 2 * 
FIRST¢SORTED# ~1¢SORTED 

FIRST[ UXpFIRSTI1¢1 

UN¢FIRST/SORTED 

FIRSTLLXpFIRST1¢OI0 

INDS¢( oFIRST) 00 

INDS(G1¢+\FIRST 
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The fifth statement sets the first element of FIRST to 1 or 0 
depending upon the index origin. The statement is needed only if you 
may be working in origin O since the first element of FIRST will 
already be 1 Cunless FIRST is empty). The plus scan (+\) and 
indexing operations in the last statement are typically more 
efficient than another dyadic 1. The last two statements are 
required to "unsort" the indices so that they are in the order of the 
elements of NV rather than of SORTED. A simpler, though less 
efficient, statement may be used in place of the last two statements: 


INDS¢(C+\FIRST)(C4G] 


In the case of Algorithm 3, there is no way to avoid the dyadic u. 
So: 


UCe (OAVeECV) /DAV * Algorithm 3 * 
INDS€UCLCV 


In the case of Algorithm 4, there is no need to do the dyadic u for 
the same reason there is no need to do any searching when determining 
the distinct values: 


BIT¢ (MAX+~0I0) 90 * Algorithm 4 * 
BITLNVI]€1 


UN¢BIT/lLpBIT 
INDS<(BIT\toeUN) [NV] 
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PROBLEMS : (Solutions on pages 337 to 339) 
1. What APL expression will select every other element (lst, 3rd, 


5th, ...) of a vector V which has an even number of elements? 


2. What APL expression will return a vector of the elements on the 
diagonal of the square matrix M? 


3. What APL expression will return the Nth column of the matrix M as 
a vector, where N is the value of the last element of the vector 
V? 
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4, 


In a character matrix NAMES of passenger names, the last names 
precede the first names and are separated from them by a slash 
(/). What APL expression(s) will replace the slashes with commas 
orn 


As an actuary, you need to determine the mortality rates for a 
set of 500 policyholders. You have a 3 dimensional "select and 
ultimate" table of mortality rates named RATES. RATES has shape 


2 100 16. The first dimension is sex (female, male). The second 
dimension is issue age (0 to 99). The third dimension is 
duration from issue (0 to 15 years). You have three 500 element 


vectors: SEX, IAGE, DUR. The elements of these vectors are in 
1-to-1 correspondence with the 500 policyholders. SEX indicates 
the policyholder’s sex (O=female; 1=male). IAGE indicates the 
issue age (0 to 99). DUR indicates duration from issue (0 to 15 
years). 


A. Construct the 500 element vector MRATES of the mortality rates 
for these 500 policyholders. 


B. Suppose some of the durations from issue (elements of DUR) 
exceed 15. For these policies, use duration 15 (the 
"ultimate" duration) but increase the issue age by the amount 
that the duration exceeds 15. Construct MRATES. 


C. Suppose you receive a memo which informs you that 40 of the 
elements in RATES are incorrect and must be modified. From 
the information in the memo, you construct 4 vectors of length 
40: NEWRATES, the correct rates; NEWSEX, the sexes for the 
new rates; NEWIAGE, the issue ages for the new rates; NEWDUR, 
the durations for the new rates. Insert the new rates into 
RATES. 


Write the monadic function UNQNV which returns the distinct 
elements of its numeric vector argument. Write UNQCV to return 
the distinct elements of its character vector argument. Write 
UNQCM to return the distinct rows of its character matrix 
argument. 


Write the dyadic functions UNQI1 and UNQIO which return the 
distinct elements of their index vector right arguments (origin 1 
or 0 indices respectively). The left argument of UNQI1 or UNQIO 
is a scalar of the number of possible indices. For example, a 
left argument of 5 for UNQIO implies that all indices in the 
right argument are elements of the set 0 1 2 3 4. 


In addition to the distinct values, each function should compute 
the indices of the right argument values into the resulting 
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distinct values. Assign the indices to the global variable 
<ind>. Place lamps (A) in front of the lines which compute <ind> 
so that the indices are not computed unless the lamps are removed. 
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FREQUENCY COUNTS, ACCUMULATIONS AND CROSS-TABULATIONS 


The often used APL expression +/A adds up the elements of the 
array A in such a way that one of the dimensions of A is eliminated, 
or "reduced". If A is a 1000 element vector, the result is a scalar 
whose one element is the sum of the 1000 elements of A. Typically, 
the elements of a vector represent measurements or counts of 
respective real world items. Then, the result of the expression +/A 
represents the sum of the measurements or counts for all items. 


Frequently in the business world, we tend to categorize items rather 
than lump them together. For example, 1000 sales transactions may be 
categorized by retail outlet or by salesperson or by product or by 
day, and so on. In such an environment, we may want to "reduce" the 
1000 element vector of invoice amounts into 10 sums (say, by 
salesperson) rather than into just a single grand total. Sucha 
problem has traditionally been called an "accumulation" problem in 
APL. Naturally, to solve such a problem, we need a corresponding 
1000 element vector whose values indicate the salesperson responsible 
for each transaction (e.g. salesperson number). Such a vector is 
called a "classification" vector. 


By analyzing the classification vector alone, we can answer questions 
such as, "For how many transactions was each salesperson 
responsible?"" Such a problem has been called a "frequency count" 
problem in APL. 


If a second classification vector is available which represents, say, 
day of the week of the sale, we may want to look at sales broken down 
Ci.e. added up) by both salesperson and day of the week. In this 
case, we want to reduce the 1000 element vector of invoice amounts 
into a 10 (salesperson) by 7 (days of the week) matrix. Such a 
problem has been called a "cross-tabulation" problem in APL. 


To avoid ambiguity from existing terminology, we present the 
following terminology and definition: 
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"n-way plus reduction on dimension d of array A by 
classification 1, classification 2, ..., and classification n" 


The summarization of array A across dimension d such that 
dimension d is replaced by n new dimensions whose magnitudes 
are the number of classes defined for the corresponding 
classifications 1, 2, ..., n. 


Let us try this terminology on an example. Suppose you have 
information on 1000 life insurance policies. A subset of the 
information is listed below: 


Issue Underwriting 
Age Sex Class Death Annual 
[O to 99] [M,F] [S,A,B,C,D] Benefit Premium 
(IAGE ) (SEX) CUCLASS ) (DBEN ) C(APREM ) 
36 M Cc 150 130 
27 M S 100 75 
42 Fr S 80 85 
M B 


The names IAGE, SEX, UCLASS, DBEN and APREM represent the names of 
the APL variables containing the corresponding data. Each variable 
is a 1000 element vector. SEX and UCLASS are character vectors and 
the rest are numeric vectors. Here are a few of the plus reductions 
that can be performed on these data: 


1. The O-way plus reduction of APREM. This is simply +/APREM. 


2. The 1l-way plus reduction of APREM by SEX. This is the 2 element 
vector (+/(SEX='M’ )/APREM), (+/(SEX='F’')/APREM). 


3. The 3-way plus reduction of DBEN by AGE, SEX and UCLASS. This is 
a 100 (Cages) by 2 (sexes) by 5 Cunderwriting classes) array whose 
elements contain the sums of the elements of DBEN for each 
combination of AGE, SEX and UCLASS. Notice that the result 
coincidentally has 1000 elements, the same number of elements as 
DBEN, the vector being "reduced". The number of elements has not 
been reduced at all. In fact, if another classification were added, 
the result would contain more elements than the vector being 
reduced. Most of the elements of the result will be 0. The data 
will have become so finely classified that each "cell" Ci.e. 
combination of classes) in the result contains at best a few policies. 


4. The l-way plus reduction of 1 (or (eSEX)e1) by SEX. This is the 
frequency count by sex: (+/SEX='M'),(+/SEX='F’). 
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5. The 2-way plus reduction on dimension 1 of DBEN,[1.5]APREM by AGE 
and UCLASS. This is a 100 (Cages) by 5 Cunderwriting classes) by 2 
(columns...the dimension not reduced) array whose elements contain 
the sum of the elements of DBEN (column 1) and APREM (column 2) for 
each combination of age and underwriting class. 
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PROBLEM: What is the frequency count (or l-way plus reduction of 1) 
by issue age (IAGE) for the above insurance policies? Call 
the result F. Accumulate (Cor li-way plus reduce) APREM by 
the same ages. Call the result A. 


TOPIC: One-Way Plus Reductions 


Let us solve this problem first by using the classical APL approach, 
which is simple but ignores efficiency considerations. The distinct 
issue ages can be determined by using an algorithm discussed in the 
previous chapter: 


DIA¢( CIAGELIAGE) =LeIAGE) / TAGE 


Next, use outer product to compare the vector of issue ages to the 
vector of distinct issue ages: 


Me«DIAoc. =IAGE 
The result is a Boolean matrix with one row per distinct issue age 
(say 40) and one column per policy (say 1000). Each column has 
exactly one 1, marking the distinct age (row) to which that policy 
(column) corresponds. The frequency count by distinct issue age 
follows directly: 

Fe+/M 
and the accumulation of APREM by distinct issue age is not far behind: 

A¢M+. XAPREM 
The dimensions of M (say 40 by 1000) and APREM (say 1000) conform 
along their inner coordinates allowing the matrix multiplication 
(+.x) which reduces the inner coordinates (1000) and returns a vector 


with the same number of elements as M has rows (40). 


This approach is simple. However, it is inefficient and is prone to 
WS FULL errors. Its inefficiencies stem from the many needless 
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comparisons and computations which take place in the °.= and +.x 
functions. Its WS FULL tendencies are caused by the potentially 
gigantic result of the outer product (°.=). 


A more efficient approach uses the sort-and-shift techniques employed 
in the previous chapter. Begin by sorting the ages in ascending 
order (retaining the grade vector): 


GRADE¢4IAGE 
SORTED¢IAGELE GRADE] 


Shift the elements of the sorted vector to the left and compare to 
the sorted vector to flag the last distinct value in each run of like 
values: 


LAST¢SORTED#Z10SORTED 


Unfortunately, if the values in the vector SORTED are all the same, 
the vector LAST will be all zeros. Yet the last value of LAST should 
be 1. To avoid this problem, you may instead use an odd expression 
like the following (which assumes that no policyholder has issue age 
~99)3 


LAST¢SORTED#1!lSORTED, 99 


Sometimes, using an arbitrary number such as ~99 is not feasible. 
Perhaps the vector could contain any conceivable value. The 
following alternate expressions will set the last (in either origin) 
element to 1, and will have no effect if SORTED is an empty vector: 


LAST¢SORTED# 1¢SORTED 
LASTE (~1+pLAST)+iUxpLASTI€1 


Select the last distinct value in each run: 
DIA¢LAST/SORTED 


You may have noticed that we determined the distinct values 
differently than in the previous chapter. There, we constructed a 
bit vector FIRST which flagged the first 1 of each run of like 
values; here, we constructed a bit vector LAST which flagged the last 
1 of each run. The reason for this minor change of algorithm is that 
LAST is more useful for determining the l1-way plus reductions. 
Consider the meaning of the expression: 


CUM¢LAST/tpLAST 

CUM has one element per distinct issue age. The values are the 
cumulative frequency counts (in origin 1). If we can undo the 
cumulative effect, we will have the fregquency counts. Cumulative 
sums are undone by taking the first differences: 


F¢CUM- (eCUM)e0,CUM 
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To accumulate APREM, we use the same approach. Reorder APREM to be 
in 1-to-1 correspondence with the sorted issue ages (SORTED); 
determine the cumulative sum; select the last element for each run: 

CUM¢LAST/+\APREME GRADE ] 
Compute the first differences to get the desired result: 

A+CUM- (oCUM) e0,CUM 
This algorithm is extremely efficient, especially on large vectors. 
As vectors get larger, the required processing time generally 
increases linearly for this grade-up (4) based algorithm; but it 
increases exponentially for the outer product (°.=) based algorithn. 
This algorithm works just as well on character classification vectors 
(e.g. UCLASS or SEX) as it does on numeric classification vectors 
(e.g. IAGE). Just begin by converting the characters to indices. 
For example: 

GRADE¢4’SABCD’ LUCLASS 

or 


GRADE¢'SABCD’ 4UCLASS 


Use the latter expression if your implementation of APL supports 
dyadic grade-up, and the former expression if it does not. 
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PROBLEM: Generate a 3 by 10 by 2 by 5 array (SMRY) which summarizes 
Cor 3-way plus reduces) the above insurance policies by 
age, sex and underwriting class. The definition of SMRY 
follows: 

SMRY[1;;;1] Total death benefits 
SMRY(€2;3;;] Total annual premiums 


SMRY[33:3;3;1] Frequency count (number of policies) 


SMRY[;I3;;] Age group I (1 is 0-9; 2 is 10-19; ...; 
10 is 90-99) 


SMRY[3;3;J;1 Sex group J (1 is male; 2 is female) 
SMRY€33;:K] Underwriting group K (1 is Standard: 2 is A; 
3 is Bs: 4 is C: 5 is D) 
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TOPIC: N-Way Plus Reductions 


This problem is awkward to solve using inner product and outer 
product techniques. The solution is confusing, WS FULL prone and 
inefficient. 


A neater solution arises when you look at the problem backwards. 
Consider the result SMRY. The three elements defined by SMRY[;1;:J;K] 
are affected by just those policies belonging to age group I, sex 
group J and underwriting group K. Conversely, each policy affects 
exactly one set of three elements in the result. 


To simplify the discussion, let’s consider just SMRY[3;;;1]1, i.e. 
frequency counts. The first element of this array represents the 
number of policies in age group 1, sex group 1 and underwriting group 
1. The second element of this (raveled) array represents the number 
of policies in age group 1, sex group 1 and underwriting group 2. 

And so on. The last (100th) element of this (raveled) array 
represents the number of policies in age group 10, sex group 2 and 
underwriting group 5. 


By considering the result as a vector, you may treat this problem as 
a l-way plus reduction (100 classes) rather than as a 3-way plus 
reduction (10 by 2 by 5 classes). All that remains is to pack 
together the values of the three classification vectors (IAGE, SEX, 
UCLASS) such that the resulting packed classification vector has 
values which distinctly identify the cell of the result affected by 
the corresponding policy. 


The ideal packing scheme is one which converts the classification 
values for each policy directly into the index of the affected 
element in the raveled result. For example: 


Raveled 
Result 
IAGE SEX UCLASS Index (RRI) 
36 (4) M (1) C (4) 34 Corigin 1) 
27 (3) M (1) S (1) 21 
42 (5) F (2) S (1) 46 
50°>:C6) M (1) B (3) 53 


The formula being used here is: 
RRICUCLASS INDEX+ (5xSEXINDEX-1)+(C10xIAGEINDEX-1) 


When working in origin 0, the formula is a bit simpler: 
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Raveled 
Result 
IAGE SEX UCLASS Index (RRTI) 
36 (3) M (0) C (3) 33 Corigin 0) 
27 (2) M (0) S (0) 20 
42 (4) F (1) S (0) 45 
50 (5) M (0) B: “€2) 52 


RRI¢UCLASSINDEX+ (5xSEXINDEX) + C10xIAGEINDEX) 


The 5 in the above formula refers to the number of elements in each 
row of the result (i.e. the number of UCLASS classes). The 10 refers 
to the number of elements in each plane (i.e. the number of UCLASS 
classes times the number of SEX classes). 


The computations of UCLASSINDEX, SEXINDEX and IAGEINDEX are 
straightforward: 


HI0O<o0 
UCLASSINDEX¢€’ SABCD’ LUCLASS 
SEXINDEX¢' F’' =SEX 
IAGEINDEX¢L IAGE+10 


The elements of the vector RRI are all integers between 0 and 99. 

You may then use logic from the prior section to determine the 
distinct values in RRI and the corresponding frequencies (1l-way plus 
reduction of 1) and l-way plus reductions of APREM and DBEN. If you 
initialize the result to be an all-zero vector of the desired length 
(the length of the raveled result), you may simply index assign the 
derived frequencies and sums using the corresponding distinct indices 
from RRI. Finally, reshape the result to the proper shape. 


The complete logic follows: 


Aa Use origin 0 throughout: 
OLO<O0 

A Index in result of column affected by policy: 
UCLASSINDEX¢' SABCD' LUCLASS 

A Index of row affected: 
SEXINDEX¢' F’ =SEX 

A Index of plane affected: 
ILAGEINDEX¢€L IAGE+10 

A Index in raveled SMRY[0;;;] affected: 
RRI¢UCLASS INDEX+5xSEXINDEX+2 x IT AGEINDEX 

a Sort result indices in ascending order: 
GRADE¢4RRI 
SORTED¢RRI {GRADE ] 

A Shift to left and compare to flag last distinct values: 
LAST¢SORTEDZ1/SORTED, 1 

A Select last distinct value in each run: 
URRI©¢LAST/SORTED 


-98- 


Chapter 7 


Aa Initialize 3 raveled all-zero arrays for SMRY[03;;3;1], 
A SMRYC1;3;3;] and SMRY[2;;;1]: 
SMRYO¢+SMRY1¢SMRY2€(10x2x5) 00 

A One-way plus reduce DBEN and insert into SMRYO: 
CUM¢LAST/+\DBEN[ GRADE] 

SMRYO[LURRI ]¢CUM-(eCUM)e0,CUM 

Aa Ditto for APREM into SMRY1: 
CUM¢LAST/+\APREM[ GRADE ] 
SMRY1[URRI 1¢CUM-( CUM) 00,CUM 

A Ditto for frequency count into SMRY2 (origin 0): 
CUM¢LAST/ Lp LAST 

SMRY2CTURRI ]€¢CUM-CeCUM)e°1,CUM 

A Catenate and reshape: 

SMRY€3 10 2 5 eSMRYO,SMRY1,SMRY2 

OTO¢1 


The logic above can be shortened somewhat if you choose to view the 


problem as a Single 3-way plus reduction on the last dimension of the 


three row matrix whose rows are DBEN, APREM and all 1s. Then, the 
logic is: 


URRI¢LAST/SORTED 

SMRY€€3,10x2x5)00 
CUM¢LAST/+\ (CDBEN, LOJAPREM,[ 70.511) 3GRADE] 
SMRY[ ;URRI 1€¢CUM- CeCUM)TO,CUM 

SMRY¢3 10 2 5 pSMRY 

OL0O¢+1 


Once this 4 dimensional array has been constructed, many questions 
can be answered by performing a few simple indexing and plus 
reduction operations. For example Corigin 1): 


ley 


How many males and females? 
+/£1]+/T3ISMRY(333;3] 


What is the total death benefit by sex and underwriting class 
(2 row, 5 column matrix)? 


+/C1ISMRYC1;3;3] 


What is the average annual premium by age group and sex (10 
row, 2 column matrix)? 


+/C1TI+/CL4I]SMRYC2 35333] 
What is the death benefit, annual premium and frequency 
breakdown by age group and underwriting class, where ages are 


broken into 5 groups: 0 to 193; 20 to 39; ...; 80 to 99? 


+/(£3] 3 5 2 5 et/C3ISMRY 
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PROBLEM: Generate a 2 by 10 by 5 array (MAX) which contains the 
maximums of (or 3-way maximum reduces) the above insurance 
policies by age, sex and underwriting class. The 
definition of MAX follows: 


MAX[1:;;] Maximum death benefits 
MAX[2;:3;:;1 Maximum annual premiums 


MAX[(;1I:J;K] Maximum death benefit and annual premium 
for age group I, sex group J, underwriting 
group kK. 


TOPIC: N-Way Maximum and Minimum Reductions 


This problem is no different from the problem in the prior section 
except an n-way maximum reduction is being performed instead of an 
n-way plus reduction. Consequently, the same solution works, up to a 
point. That point in the above solution is: 


SMRYO¢SMRY1<SMRY2€C10xX2x5)00 
CUM¢LAST/+\DBENC GRADE ] 
SMRYOCURRI J¢CUM- CeCUM) 60,CUM 


Unfortunately, you may not simply substitute [\ for +\. The 
algorithm happens to work for +\ because of the nature of addition. 
We must find a comparable algorithm which will work for maximum Cand 
hopefully minimum). 


One such algorithm involves the grade-up (4) function. The idea 
behind the algorithm is to sort the values within their respective 
classes (i.e. within like values of SORTED) and then use LAST to 
select the maximum in each class. Stated differently, you must 
perform a two-key sort where RRI is the major key and the data (DBEN) 
is the minor key. Do the minor key first: 

G¢ADBEN 
and then the major key: 

G¢GLARRI(CG] ] 


Use this grade vector to reorder the values, and use LAST to select 
the maximum in each class: 


MAX+LAST/DBEN[G] 
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or quicker: 
MAX¢DBEN[ LAST/G] 


The final step is to index assign MAX into the result variable 
(MAXO). When performing the n-way plus reduction, we initialized 
SMRYO as all zeros so that those classes which were not represented 
by any policies (i.e. cells not index assigned) would show a plus 
reduction result of zero (just as +/10 is 0). In mathematical 
terminology, the "identity element" of plus is 0. Likewise, you 
should initialize MAXO as a vector filled with the identity element 
for maximum, which is negative infinity and is returned as nearly as 
possible by the expression [/10. The rest of the solution is: 


MAXO¢MAX1€(10x2x5)oI./10 
MAXOLURRI ]+MAX 


G¢4APREM 
G¢GLARRICG]] 
MAX¢APREM[ LAST/G] 
MAX1(URRI ]©MAX 


MAX¢2 10 2 5 eMAXKO,MAX1 
OTO¢+1 


A different algorithm solves the problem without the use of 

grade-up. It uses instead maximum scan (f\). In order for maximum 
scan to be useful, the values in each subsequent class must first be 
shifted up the number scale so that the maximum value of an earlier 
class does not shadow the maximum value of a subsequent class. To 
illustrate, suppose the values of LAST and DBEN[IGRADE] are as follows: 


LAST 
Oo 0 1 0 0 0 212 0 1 #21 
DBENC GRADE J 


20 10 15 15 35 20 25 20 25 30 


Suppose we add 100 to each value in the first class, 200 to the 
second class, 300 to the third class and 400 to the fourth class: 


120 110 115 215 235 220 225 320 325 430 
The maximum scan of this vector is: 

120 120 120 215 235 235 235 320 325 430 
Using LAST to select from this vector produces: 

120 235 325 430 


Subtracting 100, 200, 300 and 400 from the respective classes gives 
the desired maximums by class: 


20 35 25 30 
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For this logic to work, you must shift (add to) the values of each 
class an amount which is at least equal to the difference between the 
maximum value in the preceding class and the minimum value in this 
Class. Since your task is to determine these very maximums, you 
cannot use these precise numbers. Rather, you can determine the 
difference between the maximum and minimum values for the entire data 
vector and use that amount. 


The following are expressions which implement this algorithm as well 
as numbers which illustrate the procedure: 


D¢DBEN[T GRADE ] 20 10 15 15 35 20 25 20 25 30 
DIF¢Cf/D)J-L/D 25 

FIRST¢ 10LAST 1o0o0o21200010i1 
T¢+\FIRST\DIF 25 25 25 50 50 50 50 75 75 100 
USCLAST/T 25 50 75 100 

R¢D+T 45 35 40 65 85 70 75 95 100 130 
R¢T\R 45 45 45 65 85 85 85 95 100 130 
MAX¢€ (CLAST/R)-U 20 35 25 30 


The solution to the above problem using this algorithm can be written 
as follows: 


URRI¢LAST/SORTED 
MAXO¢MAX1¢(10x2x5)o./10 


D¢DBEN[E GRADE ] 

DIF¢CF/DI-L/D 

T¢e+\C “1OLAST)\DIF 
MAXOCURRI]¢(CLAST//I\D+T)-LAST/T 


D¢APREML GRADE] 

DIF¢Cl/DJ-LA\D 

T¢+\C 10LAST)\DIF 

MAXI CURRIJ¢€CLAST/[\D+T)-LAST/T 


MAX¢2 10 2 5 pMAX0,MAX1 
OIO¢+1 


You can do both DBEN and APREM at once with some minor modifications: 


URRI¢LAST/SORTED 
MAX€(2,10x2x5)o./10 


D¢«C(CDBEN,£C~0.5]APREM)[ ;GRADE] 
DIF¢e¢Cl/D)-L/D 

T¢DIFo.x+\" 10LAST 

MAX sURRIJ¢(CLAST/[\D+T)-LAST/T 


MAX¢2 10 2 5 oMAX 
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There are potential problems with the maximum scan algorithm. If 
there are many classes (i.e. 1s in LAST) and if the difference 
between the maximum and minimum values of the data vector is large, 
the values in the vector on which the maximum scan is being performed 
will become immense. If they become too large (beyond 16 or so 
Significant digits), precision will be lost and the results may be 
incorrect. Even if precision is not lost, the numbers may get large 
enough to require internal floating point (usually 8 bytes per 
element) representation rather than integer (usually 2 or 4 bytes per 
element) representation. Consequently, the chances of a WS FULL 
error will increase and processing speed will decline since floating 
point numbers require more processing time than do integers. 


Despite these potential problems, the maximum scan algorithm is a 
useful and efficient algorithm. 


If the problem at the beginning of this section were stated as a 
minimum reduction problem rather than a maximum reduction problem, 
the same two approaches could have been taken, with slight 
modifications. Here are the solutions: 


[grade-up algorithm] 


* 
* 


URRI¢LAST/ SORTED 
MINO¢MIN1¢(10x2x5)eL/10 


G¢YDBEN 
G¢GCA4ARRICG]] 
MINOCURRI J1¢DBENC LAST/G] 


G¢YAPREM 
G¢GC4RRICG]I] 
MINICURRI]J©APREM(E LAST/G] 


MIN¢2 10 2 5 eMINO,MIN1 
OLO¢+1 


[minimum scan algorithm] 


URRI¢LAST/SORTED 
MIN€(2,10x2x5)oL/10 


D¢(DBEN, [0.5 JAPREM)[E ;GRADE] 
DIF¢Cl/DJ-L/D 

TeDIF°o.x+\ 1OLAST 

MIN(C ;URRI1J¢CLAST/L\D-T)+LAST/T 


MIN¢2 10 2 5 oMIN 
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PROBLEM: Suppose the variable ACTIVE is a Boolean vector with the 
same length as IAGE, SEX, DBEN, ... whose values indicate 
whether the corresponding policies are active. Generate a 
10 by 2 by 5 Boolean array (ALL) whose elements indicate 
whether (1) or not (0) all of the insurance policies (as 
defined above) are active within each possible age, sex and 
underwriting class. That is, perform the 3-way 
and-reduction (A/) of ACTIVE by age, sex and underwriting 
class. 


ALL[(I:J;K] The and-reduction of ACTIVE for age group I, 
sex group J, underwriting group K. 


TOPIC: N-Way Logical Reductions 


Once again, the solution to this problem is Similar to that of an 
n-way plus reduction. However, you need to devise an algorithm for 
A/ comparable to that for +/: 


SMRYO0¢€(010x2x5)00 
CUM¢LAST/+\DBEN[I GRADE] 
SMRYOCURRI 1<CUM-(€o0CUM) e0,CUM 


Unfortunately, you may not simply substitute A for + in the above 
logic. One simple solution may be derived from the knowledge that 
the A/ is true if the +/ equals the frequency count. As with the [/ 
and L/ problems of the last section, you must begin by initializing 
the result with the identity element for the reduction function. The 
identity element for A is one, i.e. 1=A/10. The explanation for the 
identity element is: for any Boolean array B, (14B) and (BA1) always 
return exactly B, so 1 is the identity element for Aa. The algorithm 
is: 


ALL€(10x2x5)oe1 
CUM¢ (LAST/ ie LAST) -LAST/+\ACTIVEI[ GRADE] 
ALLCURRI 1¢CUM= (CUM) 9~1,CUM ("1 for origin 0) 


A second algorithm takes advantage of the fact that (A/B) produces 
the same result as (~v/~B). To perform the v/, we need not compute 
the frequency count. Instead, we know that the v/ is true if the +/ 
is not equal to 0. The algorithm is: 


ALL€(10x2x5)o1 


CUM¢LAST/+\~ACTIVEL GRADE ] 
ALL[CURRI ]©CUM=(€CeCUM)00,CUM 
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A third algorithm may be employed which is based entirely on Boolean 
techniques. Since these techniques are discussed in the Boolean 
Techniques chapter, the algorithm is presented here without 
explanation: 


ALL€(10x2x5)ol1 

Be ACTIVEL GRADE] 

CUM¢ (LAST=2B)/LAST 

ALL(CURRI ]<(LAST/B)ACUM/ ~10CUM 


Notice that this algorithm uses no arithmetic function and makes 
heavy use of Boolean functions (2, A and /). In some implementations 
of APL, Boolean functions have been optimized to be extremely fast. 
In such implementations, the third algorithm will dramatically 
out-perform the first two. You should time the alternative 
algorithms in your own environment before deciding among them. 


If the problem at the beginning of this section were stated instead 
as an or-reduction (any) problem rather than an and-reduction (all) 
problem, similar approaches could have been taken, with slight 
modifications. Here are the solutions: 


[sum algorithm] 
ANY<(€10x2x5)00 (note: O=v/10) 
CUM¢LAST/+\ACTIVEC GRADE ] 
ANY LURRI ]©CUM# (oCUM) e0, CUM 
[Boolean techniques algorithm] 
ANY€(10x2x5)00 
BeACTIVEL GRADE] 


CUM¢ (LASTVB)/LAST 
ANYCURRI]¢CLAST/B)2=CUM/  1®CUM 


Na Os BR IND ANG Pp IN I Ns PN Ns INS Ps PN Oe PNAS NG Pe A OO Ra A 


PROBLEM: Define the syntax of utility functions for performing 
frequency counts, accumulations and cross-tabulations in 
APL. 


TOPIC: N-Way Reduction Utility Functions 
We will approach this problem by first imagining an extension to APL 


which can solve the problems of the previous sections. Imagine an 
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extension to the definition of the primitive APL reduction operator 
(@/) so that it will accept the following dyadic syntax: 


r¢(dshape;:civeci;civec2;...;civecN)@/[dlarray 


array = array being reduced; 
d = dimension of array being reduced; 


Ashape = the shape to which dimension d is "reduced"; dshape has 
N elements if an N-way reduction is to be performed; 


civeci = the class index vector for classification i (1si<£N for 
an N-way reduction); civeci has the same length as 
dimension d and its values are indices (origin 
dependent) which identify the class indices into which 
the corresponding dimension d arrays are to be grouped; 
the values of civeci are all elements of tdshape[il; 


r = the N-way (where N=oedshape) @ reduction (where @ is any 
scalar dyadic function) on dimension d of array by 
classifications cvecl, cvec2, ..., cvecN. 


This syntax calls for multiple left arguments (N+1 for an N-way 
reduction) where the arguments are separated by semicolons (;) and 
are enclosed in parentheses. Since the monadic form of reduction is 
a O-way reduction, it is equivalent to the dyadic form in which a 
Single empty vector left argument is provided. 


r¢(10)@/Cdlarray > r¢@/Cdlarray 


Note that the dyadic reduction functions (except O-way reduction) are 
origin sensitive since the left arguments contain indices. 


We may illustrate the use of this syntax by using it to solve the 
problems of the previous sections. We assume origin 1. 


One-Way Plus Reductions: 


DIA¢( (IAGELIAGE) = Le IAGE) /IAGE 
IAGEIND¢DIALIAGE 
F¢(pDIA;IAGEIND)+/1 

A¢(eDIA; IAGEIND) +/APREM 
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N-Way Plus Reductions: 


UCLASSIND¢€'SABCD’ LUCLASS 

SEXIND¢ 'MF' tSEX 

TAGEIND¢1+L TAGE+10 

SMRY€C10 2 5;IAGEIND;SEXIND;UCLASSIND)+/DBEN,[1JAPREM,[.5]1 


N-Way Maximum and Minimum Reductions: 


MAX¢€(10 2 5;ITAGEIND;SEXIND;UCLASSIND) I /DBEN,L.5JAPREM 
MIN€(10 2 5;ITAGEIND;SEXIND;UCLASSIND)LZDBEN,[1.5]APREM 


N-Way Logical Reductions: 


ALL€(10 2 5;ITAGEIND;SEXIND;UCLASSIND)JA/ACTIVE 
ANY€(€10 2 5; IAGEIND;SEXIND;UCLASSIND) V/ACTIVE 


The syntax of dyadic reduction as described in here is adequate as a 
model for user-defined utility functions. Unfortunately, current APL 
systems do not allow multiple left arguments Cunless they are packed 
together into a single "nested" array.) Also, optional arguments 
Ce.g. the d in +/{d]) are not allowed. Therefore, we must make 
compromises. 


One possible syntax is the following: 


r¢(dshape,cind1,cind2,...,cindN,{d})} PLUSRED array 
MAXRED 
MINRED 
ANDRED 
ORRED 
where: 
array = array being reduced: 
ad = dimension of array being reduced; 

Ashape = the shape to which dimension d is "reduced"; dshape has 
N elements if an N-way reduction is to be performed; d 
is optional and defaults to the last dimension of array 
if omitted; 

cindi = the numeric suffix of the name of the global class index 


vector for classification i (1s1isN for an N-way 
reduction); the name of the vector is 'I’,écindi (e.g. 
I3 or 16); the class index vectors have the same length 
as dimension da of array and their values are indices 
(origin dependent) which identify the class indices into 
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which the corresponding dimension d arrays are to be 
grouped; the values of the class index vectors are all 
elements of tdshapel[il; 


r = the N-way (where N=oedshape) @ reduction (where @ is + 


for PLUSRED, [. for MAXRED, ...) on dimension d of array 
by the classifications identified by cindi, cind2, ..., 
cindN. 


In the case of a scalar right argument (e.g. 1), PLUSRED replicates 
the scalar to a vector with the same length as the class index 
vectors. The effect is to return a frequency count by class (times 
the value of the scalar). 


We may illustrate the use of these utility functions by using them to 
solve the problems of the previous sections. We assume origin 1. 


One-Way Plus Reductions: 


DIA¢( (ITAGELIAGE) =LpIAGE) / TAGE 
IT1¢DIAl TAGE 

F¢CCeDIA),1) PLUSRED 1 
AcCCeDIA),1) PLUSRED APREM 


N-Way Plus Reductions: 


I2¢'SABCD’ LtUCLASS 

I3¢’'MF’' iSEX 

I4¢1+LIAGE+10 

SMRY€€10 2 5, 4 3 2) PLUSRED DBEN,[1JAPREM,([0.5]1 


N-Way Maximum and Minimum Reductions: 


MAX¢(10 2 5, 


4 ) MAXRED DBEN,[0.5JAPREM 
MIN¢(10 2 5, 4 


3 2 
3 2, 1) MINRED DBEN,[1.5]APREM 


N-Way Logical Reductions: 


ALL€(10 2 5, 4 3 2) ANDRED ACTIVE 
ANY¢€€10 2 5, 4 3 2) ORRED ACTIVE 


Notice that the following pairs of expressions produce identical 
results: 
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(10) PLUSRED array € +/array 
ONIO PLUSRED array <> +/array 
2 PLUSRED array €> +/L2jarray 


The writing of these utility functions is left as an exercise at the 
end of this chapter. 


PROBLEM: Suppose you wish to compute the 3-way plus reduction by 
age, sex and underwriting class of death benefits, annual 
premiums and frequency counts (as presented in the N-Way 
Plus Reduction section above.) How would you do this on 
one million policies? 


TOPIC: N-Way Reductions on Files 


So far, we have considered only arrays which can be easily 
manipulated within the active workspace. In this problem however, we 
would need several 1,000,000 element vectors: IAGE, SEX, UCLASS, 
DBEN, APREM. In many implementations of APL, the active workspace is 
not large enough to contain these variables. They must be broken 
into smaller pieces and stored ona file. 


The chapter, File Design and Utilities, discusses the application of 
APL files for storing and manipulating large amounts of information. 
In that chapter, a number of alternative file organizations are 
described. One of them, the multi-set transposed file organization, 
is a likely candidate for storing the one million policies introduced 
above. Using that organization, the information is broken into 
smaller pieces and each piece is stored as a single file component. 
Suppose the pieces are 5000 element vectors. The file will consist of 
1000 components (200 sets of 5 variables), each component containing 
a 5000 element vector. 


To compute the required 3-way reduction, you will read in one set of 
the 5 variables (i.e. 5000 policies) at a time and apply the PLUSRED 
function on them. Accumulate the results as you go. After 200 
iterations (i.e. sets), you will be done. 


The file utility function EXECUTE is designed for this type of 
problem. It reads from file one set of information at a time for 
specified variables and then performs any specified computations on 
those variables. The left argument of EXECUTE identifies the file 
being used (FP) and the variables ("fields") required. The right 
argument of EXECUTE is a character vector representation of an APL 
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expression to be executed once for each set. The variables Fl, 
F2,... (where the n in Fn is included in the list of field numbers in 
the left argument) are assigned the values of the respective fields 
for the current set. 


Suppose the variables required for this problem are located in the 
following fields of the file: 


Field 
Number Variable 
3 TAGE 
4 SEX 
9 UCLASS 
12 DBEN 
13 APREM 


You can add up the APREM variable (field 13) for all records on file 
with the following statements: 


SUM¢0 
(FP,13) EXECUTE 'SUM¢SUM++/F13' 


The EXECUTE function will execute the expression SUM¢SUM++/F13 once 
for each set on file. Before executing the expression, it will read 
from file the 5000 element vector of APREM values for the current set 
and will assign it to the variable name F13 (because the number 13 is 
in the left argument of EXECUTE). 


To perform the desired 3-way reduction, we write the following 
function: 


V SUM¢XTAB;13;14;19 
Returns the 3-way plus reduction by age, sex and 
underwriting class of death benefits, annual 
premiums and frequency counts. Requires globals: 
F3 CIAGE), F4 (SEX), F9 CUCLASS), F12 (CDBEN), 
[5] F13 (APREM). Requires fn: PLUSRED. Origin 1. 
C6] Which age class? 
[7] I3€1+LF3+10 
[8] aA Which sex class? 
[9] Il4¢'’MF'tuF4 
[10] A Which underwriting class? 
C11] I9¢’SABCD'tF9Y 
[12] A Perform the 3-way plus reduction: 
C13] SUM€(10 2 5, 3 4 9) PLUSRED F12,(011F13,£0.5]1 

V 


C1] 
C2] 
C3] 
C4] 


DDDDD D 


Then we use EXECUTE to execute this function once for each set of 
5000 policies: 


SUM¢0 
CFP,3 4 9 12 13) EXECUTE 'SUM¢SUM+XTAB’ 
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PROBLEM: Suppose you need to perform a 2-way plus reduction by A and 
B, another by B and C, and another by A and C. You could 
perform the 3-way plus reduction by A, B and C and then 
respectively plus reduce dimensions 3, 1 or 2 of the result 
to generate the desired arrays. Taking this approach to 
its extreme, you would always perform a single n-way plus 
reduction by every possible classification and then use 
monadic plus reduction to eliminate those dimensions not 
needed for particular reports. What are the problems which 
arise from taking this extreme approach? How can the 
problems be overcome? 


TOPIC: Milky-Way Reductions 


We define a new term: 


Milky-Way reduction: an n-way reduction in which you reduce by 
every classification variable under the sun, generating a result 
which may have an astronomical number of elements. 


Performing Milky-Way reductions can improve your productivity and 
reduce computer processing time consumed. Compare the following: 


17€1+lIAGE+10 Cby IAGE) 

I8¢'MF’ LSEX Cby SEX) 

I9€'’SABCD' LUCLASS (by UCLASS ) 
Milky-Way: SMRY€C€10 2 5, 7 8 9) PLUSRED APREM 


By SEX and UCLASS: 


N-way: (2 5, 8 9) PLUSRED APREM 
Milky-Way: +/(C€11]SMRY 


By IAGE and UCLASS: 


N-way: (10 5, 7 9) PLUSRED APREM 
Milky-Way: +/(£21SMRY 


By Sex: 


N-way: (2, 8) PLUSRED APREM 
Milky-Way: +/(€11+/(€31SMRY 


Unfortunately, the number of elements in the result of a Milky-Way 
reduction may be astronomical. Your active workspace may not be big 
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enough to contain them. Even if it can contain them, the process of 
plus reducing the array to a manageable size will be costly and time 
consuming because of the tremendous number of values. 


Tronically, the majority of the values are zeros. For example, if 
you perform a 7-way reduction in which the 7 classifications 
respectively involve 5, 6, 7, 8, 9, 10 and 11 classes, the result (on 
a vector) will contain 1,663,200 elements (5x6x7x8x9x10x11), 
regardless of the length of the vector being reduced. If the vector 
contains 5000 elements, the result will contain at most 5000 nonzero 
values. If the entities represented by the 5000 elements are 
somewhat similar to one another, many of the entities will be 
classified the same. Then there will be fewer than 5000 nonzero 
values in the result, perhaps much fewer. 


If we discard the zeros and retain just the nonzero values, the 
result of a Milky-Way reduction is more manageable. Of course, we 
must keep track of where the nonzero values belong in the result. 


We propose the following functions: 


Ar¢(dshape,cind1,cind2,...,cindN,{d}) APLUSRED array 
AMAXRED 
AMINRED 
AANDRED 
AORRED 


The meaning and syntax of these functions are identical to those of 
the PLUSRED, MAXRED, MINRED, ANDRED and ORRED functions introduced 
earlier in the chapter. The only difference between those functions 
and these is the result. The result of these functions is a 
"compressed Milky-Way array" which is a numeric vector whose elements 
are defined as follows (origin 0): 


ArCO] R -- rank of the array right argument 
ArC1l] D -- dimension being reduced (origin 0) 
Arf2] N -- shape of dshape, i.e. the N for an N-way 
reduction 
Ar[3+lR] S -- shape of the reduced, but not expanded 
(i.e. zero-filled), result 
Ar[(3+R)+tN] DS -- resulting shape of dimension reduced, 
i.e. dshape 
Ar((3+R+N)+tS([D]] RIND -- indices (origin 0) into the raveled 


dshape dimensions of the existing (i.e. 
nonzero) values 
Ar((3+R+N+S[D])+tx/S] DATA -- raveled existing (i.e. nonzero) values 


Since the results of these functions are not in very useful forms, we 
need another set of utility functions to convert them back to the 
more familiar multi-dimensional forms (including zeros where 
appropriate). We propose the following functions: 
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reways APLUSWAY Ar 
AMAXWAY 
AMINWAY 
AANDWAY 
AORWAY 


The right argument of these functions is a compressed Milky-Way array 
as returned by the corresponding functions APLUSRED, AMAXRED, 
AMINRED, AANDRED or AORRED. The left argument is a vector of the 
ways (i.e. indices from tN for an N-way reduction) to be returned. 
The result is the normal X-way reduction (where X=oe,ways) as returned 
by the functions PLUSRED, MAXRED, MINRED, ANDRED or ORRED. 


For example: 


A€(5 6 78 9 10 11, 12 3 4 5 6 7) APLUSRED NVEC 


OTO¢+1 

02 4 6 APLUSWAY A 
6 8 10 

01 7 APLUSWAY A 
5 il 

o2 APLUSWAY A 
6 

oe(l0) APLUSWAY A 
0 


Finally, we need to address the problem of performing Milky-Way 
reductions on files. When using PLUSRED, we execute an expression 
like the following, once for each set of, say, 5000 records on file: 


SUM¢SUM+A PLUSRED B 


This solution will not work with the Milky-Way reduction functions 
Since the values in SUM (after the first set) and the values in the 
result of APLUSRED are not corresponding data values. They are, 
instead, both compressed Milky-Way arrays. We propose the following 
functions: 


AC€AA APLUS AB 
AMAX 
AMIN 
AAND 
AOR 


These functions perform the +, [, lL, A and v functions, respectively, 
between two compressed Milky-Way arrays, returning a compressed 
Milky-Way array. If either argument is an empty array, the other 
argument is returned. 


The solution for working with files, then, needs to be modified only 
slightly when performing Milky-Way reductions: 


ASUM€LO (before loop) 
ASUM¢ASUM APLUS A APLUSRED B (within loop) 
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The writing of these functions is left as an exercise below. 
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PROBLEMS : (Solutions on pages 340 to 361) 


1. Given a 500 element character vector TZONE whose values represent 
the time zones (E, C, M, P, H) in which each of your 500 fast 
food restaurants are located, how many restaurants are located in 
each time zone? 


2. If SALES is a 500 element numeric vector whose values represent 
the annual sales, in dollars, of the corresponding 500 
restaurants of the prior problem, what are the annual sales by 
time zone? 


3. If TYPE is a 500 element character vector whose values represent 
the restaurant types (B, C, P, S) of the corresponding 500 
restaurants, how many (FRQ) restaurants of each type are located 
in each time zone? What are the annual sales (AMT) for each of 
these type/zone breakdowns? How large (MAX) was the largest 
restaurant in each of these type/zone breakdowns? 


4. Write one or more of the utility functions PLUSRED, MAXRED, 
MINRED, ANDRED, ORRED defined in this chapter. Compare your 
functions to the listings of those functions included in the 
solutions at the back of the book. 


5. Write one or more of the utility functions APLUSRED, AMAXRED, 
AMINRED, AANDRED, AORRED, APLUSWAY, AMAXWAY, AMINWAY, AANDWAY, 
AORWAY, APLUS, AMAX, AMIN, AAND, AOR defined in this chapter. 
See the listings at the back of the book. 
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6. Given the 500 element vectors TZONE, SALES and TYPE defined in 
the questions above, suppose you also have the following data: 


STATE 


MGR 


FIT 


500 row, 2 column character matrix of state (postal code) 
abbreviations indicating the states in which the 
corresponding 500 restaurants are located. 


500 element integer vector whose values represent the 
regional managers responsible for the corresponding 500 
restaurants; there are 6 managers and their respective 
numeric codes are 301, 304, 310, 322, 329 and 333. 


500 element numeric vector whose values represent the 
annual federal income tax, in dollars, of the 
corresponding 500 restaurants. 


Using the utility functions presented in this chapter, generate 
the following information: 


Ls 


Number of restaurants whose annual sales were $0 to $1 
million, #1 million to #5 million, $5 million and up. 


Number of restaurants, total sales and total FIT by state 
(given a 50 row, 2 column matrix ALLSTATES of the distinct 
state postal codes). 


Total sales and total FIT by manager, annual sales volume 
(O-1, 1-5, 5+ million) and type of restaurant. 


Number of restaurants by state and type. 


FIT by type and sales volume. 
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WRITING USER-FRIENDLY INTERACTIVE FUNCTIONS 


An interactive function is one which "prompts" you to enter 
information at the keyboard. In this chapter we discuss the 
primitive capabilities available in APL for writing interactive 
functions. We also define utility functions which make the primitive 
functions easier to apply and friendlier to use. 
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PROBLEM: Write a function ASKUSER which prompts you to "ENTER 
EMPLOYEE NAME", assigns the character vector response to 
the variable NAME, prompts to "ENTER SALARY" and assigns 
the numeric scalar response to SALARY. 


TOPIC: Primitive Interactive Functions 


The primitive niladic APL functions O (quad) and 0 (quote-quad) allow 
user interaction. When invoked, both functions causes execution to 
pause while you type information at the keyboard. When you press the 
RETURN (or ENTER) key, the typed information is returned explicitly 
and execution resumes. Graphically, the O and @ represent "windows" 
through which information passes to the computer from the outside 
world. In general, 0 is used to enter numeric information and 1 is 
used to enter character information. 


A simple solution to the stated problem follows: 


V ASKUSER 
C1] ‘ENTER EMPLOYEE NAME’ 
C2} NAME<¢O 
C3] "ENTER SALARY’ 
[4] SALARY¢UO 
V 
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On the 1st and 3rd lines, the prompts are expressed as constant 
character vectors. Since the vectors are not assigned to variable 
names or otherwise used as arguments to functions, they are 
displayed. This behavior is one of the great simplifications in 

APL: to generate output, just construct an array (as a constant or 
as the result of an expression) and do not assign it to a variable or 
otherwise use it. This is why the expression 2+3 causes 5 to display 
but the expressions A€2+3 or 6x2+3 do not cause 5 to display, even 
though the same 2+3 operation is being performed. 


This convention for generating output in APL is a mixed blessing. 
While it is simple to generate output, it is sometimes unclear from 
context whether or not output is being generated. For example, does 
the following function line generate output? 


[15] CRUNCH I 


The answer depends upon whether or not the monadic function CRUNCH 
returns an explicit result. If so, the result is not being assigned 
and so will be displayed. If not, nothing will appear Cunless output 
is generated during the execution of CRUNCH). Because of this lack 
of clarity, some APL programmers choose to show output explicitly by 
assigning it to O. For example: 


C15] O€¢CRUNCH I 


Note that the "window" analogy still holds when using O in this 
context. Now information 1s passing from the computer to the outside 
world. Not only does the O€ convention add clarity, it also enables 
you to locate (under program control) occurrences of output in case 
you wish to direct output elsewhere (say, replace ‘Oe’ by ‘OUTPUT ') 
or turn it off altogether (say, replace ‘Oe’ by '0O 0p’). 


Using this convention, let us rewrite our simple solution: 


V ASKUSER 
C1] O¢’ENTER EMPLOYEE NAME’ 
[2] NAME¢O 
C3] O¢’ENTER SALARY’ 
[4] SALARY<¢DO 
V 


When executing this function, you will see the following: 


ENTER EMPLOYEE NAME 


where the "_" symbol represents the location of the cursor (Cor print 
mechanism) while the computer is awaiting your response. 


Why is the cursor located at the beginning of the line below the 
prompt? Output in APL (via O€ or automatic output) is automatically 
followed by a "carriage return" (i.e. a newline). The only way to 
Suppress the succeeding carriage return is by assigning the output to 
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NM, the other "window". In fact, the inclusion or exclusion of the 
carriage return is the only difference between O¢ and M¢. For 
example, consider the following function: 


V DISPLAY 
C1] O¢'VALUES: ' 
[2] fe2 3 4 
[3] Oe’.!' 

V 


When you execute this function, you will see the following: 


VALUES: 2 3 4. 


Notice that the last statement uses O¢, causing the cursor to move to 
the start of the next line for any subsequent output (including the 6 
space indent you get in immediate execution mode). 


How can we modify our solution to display the prompt and leave the 
cursor beyond the prompt on the same line? 


ENTER EMPLOYEE NAME: 


Given the behavior of M¢ described above, we are compelled to try the 
following: 


[1] O¢’ENTER EMPLOYEE NAME: ' 
C2] NAME<¢f 


Sure enough, it behaves as we want it to: 
ENTER EMPLOYEE NAME: LANDER, KEVIN 


Unfortunately, when we check the value of the variable NAME, we find 
that it contains not only the name but the prompt as well: 


oNAME 
34 
NAME 
ENTER EMPLOYEE NAME: LANDER, KEVIN 


The O function returns every character on the line, whether put there 
by O¢ or by user entry. (Some APL systems return the M¢ characters 
as blanks or stars or other designated characters.) Clearly, we need 
to drop the prompt characters from the result of QO: 


C1] O¢’'ENTER EMPLOYEE NAME: ' 
C2] NAME<¢21/0 


The number 21 is the length of the prompt (including the trailing 
blank). 
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Can we apply the same technique to the SALARY prompt? No. Quad (QO) 
input causes the characters "0:'" to display. For example: 


ENTER SALARY 
O: 


If we use [¢’ENTER SALARY: ', all we accomplish is to put the "O:" at 
the end of the prompt line: 


ENTER SALARY: QO: 


If we want to eliminate the "0:" from the prompt, we must use 0 
input. However, the result will then be character valued. To 
convert the characters to the numbers they represent, we must use 
execute (#). Our final solution is therefore: 


V ASKUSER 
[1] O¢'’ENTER EMPLOYEE NAME: ' 
[2] NAME¢21/0 
[3] M¢’ENTER SALARY: ' 
[4] SALARY¢€?¢1410 
V 


PROBLEM: Write utility functions which prompt for character vectors 
or numeric vectors and which handle the problems associated 
with each. 


TOPIC: Utility Interactive Functions 


First, we will define a monadic function CPROMPT (character prompt) 
which will display its character vector right argument as a prompt 
and will allow you to enter a response at the end of the same line, 
returning your response as a character vector. 


C[(WSID: INPUT] 
V R€CPROMPT PROMPT 
[1] aA Displays character vector PROMPT, allows 
[2] a keyboard input on same line and returns 
[3] aA character vector response. 
C4] O<-PROMPT 
[51 R€Ce,PROMPT) JD 
V 
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Note the use of ravel (,) on the 2nd line to handle the case in which 
PROMPT is a scalar (e.g. R€ECPROMPT '?'’). 


Given this utility function, the solution to the problem of the last 
section may be rewritten: 


V ASKUSER 
[1] NAME¢CPROMPT 'ENTER EMPLOYEE NAME: ' 
[2] SALARY¢€@CPROMPT '/ENTER SALARY: ' 

Vv 


This function is an improvement over the previous solution. However, 
problems remain. If you accidentally type a non-numeric character 
(e.g. 3B5)} in response to the '/ENTER SALARY:’ prompt, an APL error 
message will appear (from ¢) and the function will suspend. This is 
not user-friendly behavior. 


We could use 0 input C"evaluated input") to avoid the suspension. 
When an error occurs in evaluated input mode, the error message is 
displayed and you are reprompted. For example: 


ENTER SALARY 


D: 

3B5 
VALUE ERROR 
0 3B5 

A 
O: 


However, evaluated input mode has several disadvantages. The first 
is that its appearance (0:) is odd to a naive user. The second is 
that the error messages are technical APL messages (e.g. SYNTAX 
ERROR), not user-oriented messages (e.g. DEPARTMENT NUMBER MUST BE 
NUMERIC). The third is that you may inadvertently invoke other 
functions in the workspace. The fourth is that the response will not 
be accepted on the same line as the prompt. 


Therefore, we will stick with 0 input ("character input"). We will 
define a function NINPUT (numeric input) which will display its 
character vector prompt right argument and will allow you to enter a 
response at the end of the same line. The response will be converted 
to numbers if possible and returned. If not possible, the message 
'x*x ENTER NUMBERS ONLY **’' will be displayed and you will be 
reprompted. 


How then do we convert a character vector which looks like numbers 
(e.g. '67 15') into a numeric vector (e.g. 67 15)? A number of APL 
implementations (e.g. APL*PLUS, SHARP APL) have available the 
companion monadic functions OFI (fix input or format inverse) and OVI 
(verify input). The OFI function is just what we are looking for. 

It converts its character vector or scalar right argument into a 
numeric vector result. Each group of contiguous nonblank characters 
is converted into a single numeric element. If the group of 
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characters does not represent a valid number, it is returned as 0. 
For example: 


OFT '67.5 3B6 5 O HI’ 
67.5 05 0 0 


Since invalid groups are returned as 0, another function is required 
to tell the good Os from the bad Os. This is what OVI does. The OVI 
function converts its character vector or scalar right argument into 
a Boolean vector result. Each group of contiguous nonblank 
Characters is converted into a single bit: 1 if a valid number, 0 if 
not. For example: 


OVI '67.5 3B6 5 O HI’ 
10dai1 0 


The OFI and OVI functions always return vectors, never scalars. For 
example: 


eNFI '6’ 
1 

eOFI ' : 
0 


Using OFI and OVI, the definition of NINPUT is: 


CWSID: INPUT] 
V R¢NINPUT PROMPT 
[1] a Displays character vector PROMPT, allows 
[2] aA keyboard input on same line and returns 
[3] A numeric vector response. Requires CPROMPT. 
C4] Li:R¢CPROMPT PROMPT 
[5] %CA/OVI R)eL2 
C6] O<c'*x* ENTER NUMBERS ONLY **!’ 
C7] 9L1 
[8] L2:R¢OFI R 
V 


If OFI and OVI are unavailable in your implementation of APL, you 
must write an APL function which performs a similar function. Such a 
"parsing" function is quite complicated and is not included here. 


In APL2, which does not have OFI and OVI, exception handling may be 
used to execute the character vector and to display appropriate 
messages if it can not be successfully executed. The system function 
NFA Cexecute alternate) is used. It executes its right argument and 
returns the result. If an error occurs while executing its right 
argument, its left argument is executed. 
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In APL2, the definition of NINPUT is: 


[WSID: INPUT] 
V RENINPUT2 PROMPT 
[1] a Displays character vector PROMPT, allows 
[2] aA keyboard input on same line and returns 
[3] A numeric vector response. Requires CPROMPT. 
C41 L1:R¢CPROMPT PROMPT 
[5] aA Return empty numeric vector if all-blank response: 
[6]  %CRV.#' 'JoL2 
C7] R¢10 
[8] >0 
[9] aA Allow only characters which may be parts of numbers 
[10] a (so that other functions will not be executed): 
[11] L2:2CA/RE'0123456789.7E ')LL3 
[12] A Make sure any E Cexponential notation) is not 
[13] A preceded by a blank: 
[14]  >C(v/' E’e’ ',R)eL3 
[15] A Ravel when assigning to insure a vector result: 
[16] '9L3' OEA 'Re,',R 


[17] 30 
[18] L3:0¢'** ENTER NUMBERS ONLY «x! 
£19] ?L1 

V 
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PROBLEM: Write a function ASKUSER which prompts for employee name, 
salary and project codes (numeric) and assigns the 
respective responses to NAME, SALARY and CODES. Verify 
that the name is not all-blank, that the salary is a 
positive integer less than 100,000 and that no more than 5 
project codes are entered. Terminate ASKUSER immediately 
if the user types END in response to any question. The 
explicit result of ASKUSER is 0 if END is typed and is 1 
otherwise. 


TOPIC: Utility Validation Functions 


Let us take a reverse-engineering approach to this problem. We will 
write the ASKUSER function, employing imaginary utility functions as 
needed. Then, we will write the utility functions. Here is the 
finished solution: 
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V R¢ASKUSER 


C1] R¢0O 
[2] Li: 90 ESCAPE NAME<CPROMPTE '/ENTER EMPLOYEE NAME: ' 
[3] 9L1 IF (CNAMEA.=' ') MESSAGE '** YOU MUST ENTER A NAME’ 


[4] L2:90 ESCAPE SALARY¢<1 NPROMPTE '/ENTER SALARY: ' 

C5] 9L2 IF (CSALARY#fSALARY)VSALARY<0) MESSAGE '** SALARY 
MUST BE A POSITIVE INTEGER’ 

C6] 9L2 IF (CSALARY>100000) MESSAGE '** SALARY IS 
EXCESSIVE’ 

[71 L3:90 ESCAPE CODES¢0 NPROMPTE '/ENTER PROJECT CODES: ' 

[8] 9L3 IF (€5<eCODES) MESSAGE '** TOO MANY PROJECTS’ 

[9] Rel 
Vv 


This function was easy to write and is easy to read. Comments are 
unnecessary. Our task now is to write the utility functions such 
that the function is "user-friendly" as well. 


The CPROMPTE (character prompt with escape) function behaves like the 
CPROMPT function written earlier with one exception. It checks for 
the "escape" word END and returns the numeric scalar 1 if entered. 
Otherwise, it returns the character vector entered via the keyboard. 
The CPROMPTE function is listed below. 


Since some applications permit the use of more than one escape word 
(e.g. END, QUIT, BACKUP, ABORT, HELP, PRINT, etc.), we have written 
CPROMPTE to illustrate the use of several escape words, specifically 
END, BACKUP, and ABORT. CPROMPTE returns the scalar 1, 2 or 3 if the 
respective escape word is entered. 
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fWSID: INPUT] 
V R¢CPROMPTE PROMPT;S 
Displays character vector PROMPT, allows 
keyboard input on the same line and returns 
character vector response. Checks for entry 
of escape words END, BACKUP or ABORT and 
returns corresponding numeric scalar 1, 2 or 3 
if even partially entered. (Modify to include 
C7] your own set of escape words or to use exact 
C8] matching.) Requires: CPROMPT. 
[9] R¢CPROMPT PROMPT 
[10] A Exit if empty entry: 
[11] »€xS¢eR)10 
[12] aA Branch unless 'END’ partially entered: 
[13]  %C(RV.#ST'END’ JoL1 
[14] aA For exact (not partial) match: 
[15] A >((3#S)V'END'V.#3TR)oL1 
[16] aA Or, if = is available: 
A 
A 


C1] 
[2] 
C3] 
C4] 
[5] 
C6] 


D>DdDDIDIODO 


C17] >(C'’END’ =R)LL1 
[18] Else return scalar 1 (in origin 1): 
[19] R¢OQIO 
[20] 0 
[21] A Return 2 if 'BACKUP’ entered: 
[22] Li: >%C(RV.#ST' BACKUP’ )oeL2 
[23] R¢1+010 
[24] 30 
[25] a Return 3 if ‘ABORT’ entered: 
[26] L2:>(RV.#ST' ABORT’ )00 
[27] R¢e2+0I0 
V 


The ESCAPE function is a dyadic function which checks to see if its 
right argument is a scalar. If so, it returns its label left 
argument. If not, it returns an empty vector so that no branch will 
take place. The ESCAPE function is listed below. 


The ESCAPE function has been written to accomodate multiple escape 
words. For example, the expression, 


9(L99,L1,0) ESCAPE NAME¢CPROMPTE 'ENTER EMPLOYEE NAME: ' 
will cause a branch to one of the "labels" L99, L1 or 0 if the 
corresponding escape word END, BACKUP or ABORT is entered. If a 


single label is provided as ESCAPE’s left argument, that label is 
returned if any of the escape words is entered. 


-124- 


Chapter 8 WRITING USER-FRIENDLY INTERACTIVE FUNCTIONS 


CWSID: INPUT] 
V R¢LABELS ESCAPE CODE 
Used as: 


9(L1,L2,0) ESCAPE NAME¢CPROMPTE 'ENTER NAME: ' 


Returns LABELS(CODE] if code is a scalar. 
Otherwise, returns 10 so no branch occurs. 
If LABELS is a singleton, it is returned for 
any scalar CODE. 
Return empty vector for non-scalar CODE: 
[10] R¢€v0 
C11] > €xeeCODE)e0 
[12] A Return LABELS for singleton LABELS: 
[13] R¢LABELS 
[14] %(1A.=eLABELS) 00 
[15] A Otherwise, return label for corresp escape code: 
C16] R¢LABELS[ CODE] 

V 


a | 

1 

eed 
>DDDDDIDIODND 


The IF function is the standard conditional branching function. 


([WSID: INPUT] 


V REL IF C 
[1] aA Conditional branch function. Used as: 
[2] A LABEL IF I>50 
C3] R€¢C/L 

Vv 


The MESSAGE function is a dyadic function which returns its Boolean 
left argument and which displays its character vector right argument 
only if the left argument is 1. 


C(WSID: INPUT] 
V R¢BIT MESSAGE CVEC 
C1] a Displays err msg CVEC if BIT=1. Used as: 


9ASK IF (X<0) MESSAGE '/VALUE IS NEGATIVE’ 


rm 
ee) 
wi 
oD D 


[5] R¢BIT 

[6] >BITLO 

C7] OU<CVEC 
V 


The NPROMPTE (numeric prompt with escape) function is dyadic. Its 
left argument is the number of numbers required. Since a check for 
the exact number of numbers entered (usually 1) is the most common 
numeric input check, we will build the check into the function. If 
the left argument is 0, we will accept any number of numbers. Since 
the result of NPROMPTE is passed as the right argument of ESCAPE, we 
Will use CPROMPTE within NPROMPTE and will return a numeric scalar 


=125>= 


Chapter 8 WRITING USER-FRIENDLY INTERACTIVE FUNCTIONS 


Cescape) result directly instead of the normal numeric vector 
response. 


The NPROMPTE function for APL*PLUS or SHARP APL follows: 


CWSID: INPUT) 
VY R¢ENUM NPROMPTE PROMPT 
Displays character vector PROMPT, allows 
keyboard input on same line and returns 
numeric vector response of length NUM 
Cor of any length if NUM=0). Returns 
numeric scalar escape code if escape word 
entered. Requires: CPROMPTE. 
C7] L1i:R¢CPROMPTE PROMPT 
[8] aA Exit if scalar escape code: 
[9] >CoeRILO 
[10] %CA/OVI R)/L2 
[11] O¢«'*x* ENTER NUMBERS ONLY x«x! 
C12) >L1 
[13] L2:R¢€OFI R 
[14] a Exit if NUM is 0 or is length of input: 
[15] -»NUMILO 
[16] 3C(NUM=eR)/0 
C17] Oe'*k* ENTER ',CSNUM),’ NUMBER’ ,CNUM=1)/’'S «x! 
C18] Li 


C1] 
[2] 
ie 


DDDDOOD OD 


~126- 


Chapter 8 


C1] 
[2] 
C3] 
C4] 
[5] 
C6] 
C7] 
C8] 
[9] 
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The NPROMPTE function for APL2 follows. 


CWSID: INPUT] 


V R¢NUM NPROMPTE2 PROMPT 


Displays character vector PROMPT, allows 
keyboard input on same line and returns 
numeric vector response of length NUM 

(or of any length if NUM=0). Returns 
numeric scalar escape code if escape word 
entered. Requires: CPROMPTE. 
L1:R€¢CPROMPTE PROMPT 

A Exit if scalar escape code: 

>CoeR)LO 


0 DDDND 


2D 


[10] A Return empty numeric vector if all-blank response: 
[11]  >C(Rv.#' ')JoL2 
C12] R€to 
C13] 74 
[14] aA Allow only characters which may be parts of numbers 
[15] a (so that other functions will not be executed): 
[16] L2:3CA/RE’0123456789. E ')IL3 
[17] A Make sure any E (exponential notation) is not 
[18] A preceded by a blank: 
[19]  %(v/' E'e’ ',RIEL3 
[20] A Ravel when assigning to insure a vector result: 
[21] '29L3' UOEA 'R¢e,’,R 
[22] ?L4 
C23] L3:O0¢’** ENTER NUMBERS ONLY «x! 
C24] -L1 
C25] A Exit if NUM is O or is length of input: 
[26] L4:%»NUMLO 
[27] > (NUM=oR)/0 
C28] O¢e'*x* ENTER ',C6NUM),' NUMBER’ ,CNUM=1)1'S x*x’ 
[29] »L1 
V 
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PROBLEMS : CSolutions on pages 362 to 363) 


1. Write a function LPROMPTE (letter prompt with escape) which 
prompts for a single letter. The right argument is the character 
vector prompt. The left argument is a character vector of 
allowable single characters which the user may enter. The result 
is a one element vector index into the left argument of the 
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character entered or is the numeric scalar escape code if an 
escape word is typed. To illustrate: 


[10] »~0 ESCAPE ACTION€’ACD' LPROMPTE 'ADD, CHANGE, DELETE: ' 
[11] »CADD, CHANGE, DELETE) CACTION] 


2. Suppose you are writing interactive functions for a user who does 
not have an APL terminal. Without an APL terminal, the user 
cannot enter a negative symbol (7). What modifications would you 
make to the utility functions described in this chapter to allow 
the user to enter a minus symbol (-) for negative numbers (e.g. 
-38)? 


3. Write a niladic function PROPOSAL which generates a proposal for 
life insurance as follows: 


PROPOSAL 
NAME: Fred 
NUMBER OF KIDS: 3 


AGES OF KIDS: 3 4 8 
PRESS ENTER WHEN READY... (press ENTER key) 


Dear Fred: 


As a proud parent of 3 kids (whose 
average age is 5), you need insurance. 


(press ENTER key) 
GENERATE ANOTHER PROPOSAL? N 


Use the utility functions developed in this chapter. 
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MANIPULATING DATES 


In the field of data processing, one of the more commonly 
processed forms of data is dates. Dates tell us when employees were 
hired, when bonds mature, when insurance policies take effect, when 
commissions are due, when materials must be reordered, when expenses 
are incurred, and so on. Despite the many uses of dates, the number 
of different tasks performed on dates is small. This chapter 
discusses those tasks: representing dates in APL, entering dates, 
displaying dates and manipulating dates. 
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PROBLEM: Given that APL supports only two datatypes (numbers and 
characters), how should dates be represented? 


TOPIC: Representation of Dates in APL 


Suppose you wish to keep track of the date March 22, 1986. What are 
the different possible conventions you might employ to store this 
date? Here are several: 


1. DATE*¢'MARCH 22, 1986' C'MONTH DD, YYYY’) 

2. DATE¢3221986 CMMDDYYYY) 

3. DATE©€19860322 CYYYYMMDD) 

4. DATE¢860322 CYYMMDD) 

5. DATE¢1986 3 22 CYYYY MM DD) 

6. DATE¢1986 81 CYYYY DDD, days from December 31 
of the previous year) 

7. DATE€31127 (Days from December 31, 1899) 


In order to choose the best convention, you must consider the ways in 
which the date is to be used. Different representations are better 
for different applications. 
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For example, the first representation (DATE¢'MARCH 22, 1986') is 
ideal if all you want to do with the date is display it. However, 
the representation requires 14 bytes (characters) of storage which is 
more than any of the other representations and it does not lend 
itself to chronological sorting or to date arithmetic (say, adding 3 
months to it). 


The second representation (DATE¢3221986) requires less storage (4 or 
8 bytes depending upon the APL implementation) and is still fairly 
easy to display in a meaningful form (say 3/22/1986). However, it 
also requires transformation before it can be sorted or used in date 
arithmetic (since 3221986 is greater than 1221987 but 3/22/1986 
occurs earlier than 1/22/1987). 


The third representation (DATE¢19860322) requires the same storage as 
the prior representation and is fairly easy to display ina 
meaningful form (say 1986/03/22) and can be sorted with or compared 
to other dates directly, without transformation. For example, since 
19870122 is greater than 19860322, it occurs later. 


The fourth representation (DATE¢860322) is similar to the prior 
representation. Its advantage is that it displays in two fewer 
character positions (say 86/03/22 vs. 1986/03/22}, though the storage 
requirements are the same. Its disadvantage is that the year is 
ambiguous and may not sort properly when comparing to dates in the 
next century (Ce.g. 15/03/22 for 2015/03/22). 


The fifth representation (DATE¢1986 3 22) has different storage 
requirements than the prior three representations (more or less 
depending upon the APL implementation). It is easier to work with 
for some manipulations (say, year arithmetic) but harder for others 
(say, comparing dates to see which is later). 


The sixth representation (DATE¢1986 81) makes day arithmetic easier 
but meaningful display harder. For example, it is simple to see that 
the date 50 days beyond 1986 81 is 1986 131. However, it is not as 
Simple to see that 1986 131 is May 11, 1986. 


The seventh representation (DATE¢31127) makes day arithmetic, date 
comparisons and chronological sorting trivial operations. However, 
converting the date to a meaningful form (year, month, day) is a 
complex task. 


Depending upon the specific requirements of your application, you may 
decide to pick any one of these or another form of date 
representation. If you are undecided, representations 3 
(DATE¢19860322) and 5 (DATE¢1986 3 22) are good choices. These 
representations seem to provide a nice balance between the extreme 
forms 1 and 7. 
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PROBLEM: Write a monadic function IPDATEMDY Cinput date in month, 
day, year order) which may be used to convert a date, as 
entered, into the internal representation of the date. The 
right argument of IPDATEMDY is the character vector 
representation of the date in month, day, year order and 
the result is the numeric scalar representation (YYYYMMDD) 
of the date. For example: DATE¢IPDATEMDY CPROMPT 'ENTER 
DATE OF HIRE: ' (where CPROMPT returns the character vector 
response to the prompt provided as its right argument). 

The result is 0 if the right argument does not represent a 
valid date. 


TOPIC: Entering and Validating Dates 


To be as friendly as possible, the function must allow you to enter 
the date in any reasonable form. For example, if the date being 
entered is March 22, 1986, the right argument (i.e. your response) 
may be in any of the following forms: 


3/22/86 

3.22.1986 

3 22 1986 

3 22 Cif the current year is 1986) 

3-22 

3-22-86 

and so on 
To be as safe as possible, the function must verify that the date 
entered is a valid date. For example, some dates which should be 
rejected are 13/25/86 and 2-29-86. 


The function follows: 
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[WSID: DATES] 


V YYYYMMDD¢IPDATEMDY CVEC;DD;MM;NVEC;YY;H010 


A Converts the character vector representation 
A of a date (e.g. '6/15' or '3-22-1986' or 
Aa '3 22' or '3.22.86') to an integer scalar 
A representation (CYYYYMMDD) of the date. 
a The items in the right argument are in month, 
Aa day, year order. The result is 0 if the 
a date is invalid. 
A 
O1T0¢1 


A Ravel CVEC in case a scalar; Jae 
A by space: 

CVECe , CVEC 

CVECE (CCVECE'/-.')/LECVECI€'’ ' 

a Set result to 0 and exit if date not valid: 
YYYYMMDD¢€0O 

A Date must contain only digits and spaces 
A Conce / and - are converted): 

X9CA/CVECE’ 0123456789’ )L0 

A Convert character vector to numeric vector: 
NVEC¢e, ¢CVEC 

A Date must have 2 or 3 elements (MM DD or 
A MM DD YY): 

*(COoNVECIE 2 3)10 

A Stick on current year if omitted: 
>*(3=eNVEC)eL1 

NVEC¢ENVEC,1e0TS 

A Convert YY to YYYY using current century: 
L1: YY¢NVECLE3] 

>*CYY>99)pL2 

YY¢YY+100xLOTS[£11+100 

A Validate month: 

L2:MM¢NVEC[1] 


replace 


>(MMEL12)10 

A Validate day of month: 

DD¢ENVECL 2] 

9(€CDD<1) VDD>(31 29 31 30 31 30 31 31 30 31 30 
e0 


A Check 2/29 if a leap year: 

+( C(MM#2) VDD#29)eL3 

A Leap year every 4 years except at centuries 
A Cexcept 4th centuries): 
>CCOF4IYYIVCO=LOOIYYIAO#4Z001YY) 00 

A Pack date into YYYYMMDD format: 

L3: YYYYMMDD<1001YY,MM,DD 


Pu Bs es Pur A A Po A Pu Pe A Pus A Bs A A se A Be Os PO Pe AA 


<132= 


Chapter 9 MANIPULATING DATES 


PROBLEM: Given the three element vector DATE which represents a 
single date (in YYYY MM DD format), write the APL 
expressions which will generate each of the following date 
formats (for DATE€1986 3 22): 

a. MAR 22, 1986 


b. March 22, 1986 
c. 3/22/86 


TOPIC: Formatting Dates for Output 


In the first format (MAR 22, 1986), we must convert the month from an 
integer (e.g. 3) to a 3 element character vector (e.g. 'MAR’). The 
most direct way to do this is to first construct a 3 column character 
matrix which has one row per possible month: 

MON¢€12 30'’JANFEBMARAPRMAYJUNJULAUGSEPOCTNOVDEC' 


Then we may use the month number as the row index into MON. Here is 
one approach (showing 1986 3 5 as MAR 05, 1986): 


MONT DATE(L21]53;],' ',€ 2T'0’ ,¢DATE[3]),’, ',¢DATEL1] 


This approach does not work for a matrix (3 columns) of dates. It 
must be modified, as in the following: 


DAY¢2 OSDATEL ;3] 

DAY[(’ '=DAY)/ipDAYI]¢'0! 

DAY¢( (1TeDATE) ,2) epDAY 

MONEDATEL;21];],’ ',DAY,’,’,5 OSDATEL;:,1] 


If your implementation of APL supports the system function OFMT, you 
may construct this date format with the following: 


One date: 
MONE DATE[T2131,,'XK1,212,<,>,15' OFMT (CDATE(3];DATE[1]) 
Matrix of dates: 
'3A1,X%1,Z212,<,>,15’' OFMT (CMON(DATE( :21;:1;DATE[;3 1]) 
In APL2, you may do the following: 
One date: 
MONCDATE[L2]3;],' 05, 5555’'sDATEL[3 1] 
Matrix of dates: 


MONT DATEL $321]3;1,' 05, 5555'stDATEL;3 1) 
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In the second format (MARCH 22, 1986), the entire month name is 
displayed. This format differs from the first in that the length of 
the formatted result is not fixed but depends upon the length of the 
month name and upon the number of digits in the day. 


One approach to selecting the month name portion of the date is to 
build a character matrix of month names (padded to the right with 
blanks), extract the appropriate row and squeeze out the blanks: 


MONTH€12 9oe’'’JANUARY FEBRUARY MARCH APRIL MAY...’ 
MON¢MONTHLE DATEL 2]; ] 
MON¢€(MON#! °')/MON 


A second approach is to build a character vector of month names and a 
corresponding vector of the lengths of the names. The vector of 
lengths may be used to locate the corresponding name: 


MONTH¢ ' JANUARY FEBRUARYMARCHAPRILMAYJUNEJULYAUGUST...’ 
MLEN¢ 785534469 7 8 8 
MON¢MONTHE €0,+\MLEN) CDATEL2]1+UMLENCDATEC2]1] 


Using either approach, once the name is selected, the formatting of 
the date is simple: 


MON,’ ',(é®DATE(31]),’', ',¢DATEL1] 


Generally, the second format (MARCH 22, 1986) is not used when 
formatting many dates at once since they will have a ragged 
appearance: 


MARCH 22, 1986 

MAY 3, 1986 
SEPTEMBER 17, 1987 
JULY 4, 1988 


On the other hand, you may choose to align the days and years: 


MARCH 22, 1986 
MAY 03, 1986 
SEPTEMBER 17, 1987 
JULY 04, 1988 


To construct this result, you can simply use the same approaches 
discussed for the first format (MAR 22, 1986), but create a 9 column 
character matrix of right-justified month names instead of a 3 column 
character matrix of abbreviated month names: 

MON¢12 90’ JANUARY FEBRUARY MARCH APRIL MAY...!' 
In the third format (3/22/86), the month number does not need to be 


translated into a month name. However, the first two digits of the 
year must be truncated. Here is one approach: 


CS$DATEL21),'/',C 2T'0O' ,séDATE[3]),'/', 2TSDATEL1] 
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This approach does not work for a matrix (3 columns) of dates. It 
must be modified, as in the following: 


C’/',4 O&FDATEL;,1],DATEC331]+100xDATE[L321])(03;6 71891 4 5] 
If your implementation of APL supports OFMT, you may do the following: 
One date: 
»'12,</>,Z212,</>,Z12' OFMT CDATE(L2];DATEL31]1;1001DATE[11]) 


or 
»'G<Z9/99/99>' OFMT 1OOLDATEL2 3],100IDATEL1] 


Matrix of dates: 
'T2,</>,2Z212,</>,2Z12' UOFMT CDATELC:2 31;100!IDATEL;:1]) 


or 
'G<Z9/99/99>' OFMT C1001DATEL 3$11])+100xXDATEL 333+100XDATEL 32] 


In APL2, you may do the following: 
One date: 
'56/06/05' 6DATEL2 3 1] 
Matrix of dates: 


'56/06/05'sDATEL 532 3 1] 
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PROBLEM: Since some tasks involving dates are more easily solved 
from certain date representations than from others, design 
and write a set of date conversion functions which can be 
used to convert between the various date representations. 


TOPIC: Manipulating Dates 


Suppose you limit yourself to 4 different date representations. You 
will need 24 different date conversion functions to handle every 
possible conversion (4 types times 3 remaining types times 2 
directions: to or from). If you handle 5 different date 
representations, you will need 40 different functions. 


You can reduce the number of functions needed by assuming a "base" 
date representation. If you define just enough functions to be able 
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to convert any date representation to or from the base 
representation, you can do any possible conversion, though it may 
require two steps instead of one. For example, to convert dates from 
a first representation to a second, neither of which is the base 
representation, you can convert the dates from the first 
representation to the base representation and then from the base 
representation to the second representation. 


Let us use YYYYMMDD (e.g. DATE¢19860322) as the base representation. 
We choose this representation because it can serve a number of 
functions directly, without conversions: 


a. Such dates can be sorted chronologically using grade-up (4A) or 
grade-down (¥). 


b. Such dates can be compared chronologically (before or after) 
using the relational functions (=, #, >, <, 2, =). 


c. Such dates can be displayed directly (or with minor formatting) 
and be readily interpreted by the reader. 


If you prefer a different base date representation for your 
applications, you may modify the functions below to suit your needs. 


Assuming the YYYYMMDD base date representation, we propose the 
following date utility functions: 


1. MMDDYYYY¢TOMDY YYYYMMDD 
Converts from YYYYMMDD format to MMDDYYYY format. For example: 
TOMDY 19860322 19870209 
3221986 2091987 
2. YYYYMMDD¢FROMMDY MMDDYYYY 
Converts from MMDDYYYY format to YYYYMMDD format. For example: 
FROMMDY 3221986 2091987 
19860322 19870209 
3. DAYS¢TODAYS YYYYMMDD 


Converts from YYYYMMDD format to number of days since 
February 29, 0000. For example: 


TODAYS 19860322 19870209 
725393 725717 
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4. YYYYMMDD¢FROMDAYS DAYS 


Converts from numbers of days since February 29, 0000 to 
YYYYMMDD format. For example: 


FROMDAYS 725393 725717 
19860322 19870209 
5. OTS¢TOOTS YYYYMMDD 
Converts from YYYYMMDD format to 3T0TS format (i.e. YYYY MM DD). 
The shape of the result is the catenation of 3 and the shape of 


the right argument. For example: 


TOOTS 19860322 19870209 


1986 1987 
3 2 
22 9 


6. YYYYMMDD¢FROMOTS QTS 


Converts from 3TOTS format (i.e. YYYY MM DD) to YYYYMMDD format. 
The first element of the shape of the right argument must be 3. 
The shape of the result is all but the first element of the 
shape of the right argument. For example: 


FROMOTS 3 2 e 1986 1987 3 2 22 9 
19860322 19870209 


7. DAYS360€TODAYS360 YYYYMMDD 


Converts from YYYYMMDD format to number of days since 

January 1, 0000, assuming a 30 days per month, 12 months per 
year, 360 days per year calendar (the 3lst day of the month is 
treated like the 30th day). Financial institutions frequently 
assume 360 days per year. For example: 


TODAYS360 19860322 19870209 
715041 715358 
8. YYYYMMDD<¢FROMDAYS360 DAYS360 
Converts from number of days since January 1, 0000 to YYYYMMDD 
format, assuming a 30 days per month, 12 months per year, 360 


days per year calendar. For example: 


FROMDAYS360 715041 715358 
19860322 19870209 
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February 29, 0000 was chosen as a base date in the TODAYS and 
FROMDAYS functions for computational reasons. At the end of that 
leap day, a 400 year cycle of leap years began. The conversion from 
dates to days or vice versa is easier when March 1 is considered the 
first day of the year. When a leap year occurs, the leap day is the 
last day of the year. 


Let us illustrate the application of these functions by using them to 
solve a variety of problems. The following problems assume that the 


variable DATES is a vector of dates whose values are assigned in the 
YYYYMMDD format (Ce.g. DATES€19860322 19870209 19851225...). 


A. How many dates occur in 1987? 
+/ (DATES219870101) ADATES<19871231 (no conversion needed) 
B. Display in MM/DD/YYYY format the dates derived by adding 30 
years to each date. 
'G<Z9/99/9999>' DOFMT TOMDY 300000+DATES Cassuming OFMT) 
'55/55/5555'sTOMDY 30000+DATES (assuming APL2) 
C. What dates result when adding 90 days to each date? 


FROMDAYS 90+TODAYS DATES 


D. Which dates occur in any September? 
C9=(TOQOTS DATES) (2;1)/DATES 
E. Assuming a 360 day year (as in bond calculations), how many whole 
6 month periods (i.e. semiannual coupons) are there from each date 
to the date July 4, 1995? 
L¢CTODAYS360 19950704)-TODAYS360 DATES)+180 
F. Display Cin YYYY/MM/DD format) the dates in the past (before 
today’s date), in reverse chronological order (present to past). 
D¢CDATES<FROMQOTS 3T0TS)/DATES 


'G6<9999/99/99>' DOFMT DI[YD] Cassuming OFMT) 
'555/55/55'sDL YD] Cassuming APL2) 
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G. Compute the ages (age last birthday) today of people born on each 
of the dates. 


TODAY<¢3 TOTS 
YMD¢TOQOTS DATES 
CTODAYC1J-YMDE£1;1)-CLOOLTODAY£L2 3)3])<1001YMD[2 33] 


The definitions of these date utility functions follow. In those 
instances for which two substantially different algorithms are 
available to perform the same task, two functions have been provided, 
one with the name suggested above and the second with the same name 
followed by ‘A’ (e.g. FROMDAYS and FROMDAYSA). You may want to time 
the alternate functions for your APL installation to determine which 
is faster (see the Computer Efficiency Considerations chapter). 


[WSID: DATES] 
V MMDDYYYY<TOMDY YYYYMMDD 
[1] aA Converts dates in form YYYYMMDD to form 
C2] aA MMDDYYYY by numerical manipulations. 
[3] a The steps: 19860322 7% 322 9 
[4] A €32200000000-322) >» 32219860000 »* 3221986 
C5] A 
C6] MMDDYYYY¢l CYYYYMMDD+99999999x10000 I YYYYMMDD)+10000 
V 


CWSID: DATES] 

V MMDDYYYY<TOMDYA YYYYMMDD 
[1] aA Converts dates in form YYYYMMDD to form MMDDYYYY 
[2] aA by unpacking, rotating and re-packing the digits. 
[3] A The steps: 19860322 >» 1986 322 » 322 1986 » 3221986 
[4] aA 
[5] MMDDYYYY¢< 10000 1 +.xe O 10000 TYYYYMMDD 
[6] aA Alternative: 
[7] A MMDDYYYY¢ O 10000 16 O 10000 TYYYYMMDD 

Vv 


CWSID: DATES] 
VY YYYYMMDD<FROMMDY MMDDYYYY 
[1] aA Converts dates in form MMDDYYYY to form 
[2] A YYYYMMDD by numerical manipulations. 
[3] aA The steps: 3221986 > 1986 9 
[4] A €198600000000-1986) > 198603220000 >» 19860322 
[5] A 
C6] YYYYMMDD<¢L CMMDDYYYY+99999999x10000|MMDDYYYY)+10000 
V 
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C1] 
C2] 
[3] 
[4] 
[5] 
[6] 
C74 


on 
[2] 
C3] 
[4] 
[5] 
C6] 
[7] 
[8] 
[9] 
C10] 
C11] 
C12] 
[13] 
C14] 
[15] 
[16] 


C1] 
C2] 
[3] 
C4] 
[5] 
C6] 
[7] 
[8] 
[9] 
C10] 
C11] 
[12] 
C13] 
C14] 
[15] 
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[WSID: DATES] 
¥V YYYYMMDD<¢FROMMDYA MMDDYYYY 
A Converts dates in form MMDDYYYY to form YYYYMMDD 
A by unpacking, rotating and re-packing the digits. 
A The steps: 3221986 > 322 1986 >» 1986 322 > 19860322 
A 
YYYYMMDDe 10000 1 +.xe Q 10000 TMMDDYYYY 
Aa Alternative: 
A YYYYMMDD< O 10000 16 0 10000 TMMDDYYYY 
V 


C[WSID: DATES] 
V DAYS¢TODAYS YYYYMMDD;DD;YYYYMM;MM:YYYY;010 
A Converts date (YYYYMMDD) to number of days since 
A Feb. 29, OOOO. 
OLO¢+1 
DD¢€100!1YYYYMMDD 
YYYYMM¢ (CYYYYMMDD-DD)+100 
MM¢1001YYYYMM 
YYYY¢CYYYYMM-MM)+100 
A Treat Jan and Feb as if in prior year (to have 
A leap day at end of yr) 


YYYY¢YYYY-MM<2 
A Days from Feb. 29, 0000 to prior Feb. 28/29 (leap 
A year every 4th year, no leap year every 100th year, 
A leap year every 400th year): 
DAYS€(C365XYYYY)+-/LYYYYo.+ 4 100 400 
A Add in DD days and days from prior Feb. 28/29 
DAYS¢DAYS+DD+(306 337 0 31 61 92 122 153 184 214 245 
275) CMM] 

V 


[WSID: DATES] 

VY DAYS¢TODAYSA YYYYMMDD;DD;MM; YYYY:YYYYMM;O01I0 
A Converts date CYYYYMMDD) to no. of days 

A since Feb. 29, OOOO. 

DD¢€1001YYYYMMDD 

YYYYMM¢ CYYYYMMDD-DD)+100 

MM<1001YYYYMM 

YYYY¢CYYYYMM-MM)+100 

A Treat Jan and Feb as if in prior year (to 
A have leap day at end of year): 

YYYY<YYYY-MM<2 


A Days from Feb. 29, 0000 to prior Feb. 28/29 

A €146097, 36524, 1461, 365 days in 400, 100, 

A 4, 1 year cycles): 
DAYS€ 146097 36524 1461 365 +.x 0 4 25 4 TYYYY 

A Add in DD days and days from prior Feb. 28/29: 
DAYS¢DAYS+DD+(€306 337 0 31 61 92 122 153 184 214 245 
275) CMM] 

Vv 
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Ci] 
[2] 
C3] 
[4] 
C5] 
C6] 
C7] 
C8] 
[9] 
C10] 
C11] 
C12] 
C13] 
[14] 
[15] 
C16] 
C17] 
C18] 
C19] 
[20] 
C21] 
[22] 
[23] 
C24] 
C25] 
[26] 
[27] 
[28] 


[29] 
C30] 


C31] 
C32] 
C33] 
C34] 
C35] 


V 


Vv 
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CWSID: DATES] 
YYYYMMDD¢FROMDAYS DAYS;DD;IND;MM; PDAYS;RDAYS;SHAPE;Y; 
Y¥YYY;O1L0 


Aa Converts number of days since Feb. 29, 0000 
A to date CYYYYMMDD). 
OTo0¢+1 
A Work with array as a vector and reshape when done: 
SHAPE¢eDAYS 
DAYS¢€,DAYS 
A Approximate year Conly off for some 2/28, 
A 2/29 and 3/1 dates); 365.2425 is used because 
a there is a leap year every 4th year (+.25), 
A no leap year every 100th year (-.01), leap 
A year every 400th year (+.0025): 


YYYY¢LDAYS+365.2425 

A Number of days from Feb. 
A prior year: 
PDAYS€(C365XYYYY)+-/LYYYYo.+ 4 100 400 

A Number of days from start of year to specified date: 
RDAYS«€DAYS-PDAYS 


29, 0000 to Feb. 28/29 of 


A Branch unless year may be too small by 1 (e.g. 3/1): 
+( xp IND¢ (RDAYS2=366)/tleRDAYS)JL1 
YYYYCINDI¢Y¢YYYYCINDI4+1 
RDAYSCINDIJ©¢DAYSLCINDI-C€C365xY)+-/LY°.+ 4 100 400 

A Branch unless year too big by 1 (e.g. 2/29 looks 


A like 3/0): 

L1:>(xopIND€ (RDAYS<0)/tpRDAYS)/L2 
YYYYCINDI¢Y¢YYYYCINDI]+°1 
RDAYSCINDI¢DAYSCTIND1]-(365xY)+-/LY°.+ 4 100 400 

A Determine month no. from no. days from start of yr: 

L2:MM¢(31 30 31 30 31 31 30 31 30 31 31 29 /2%112)E 
RDAYS] 

A Determine day no. from no. days from start of mon.: 
DD¢RDAYS-(€306 337 0 31 61 92 122 153 184 214 245 275). 
MM] 

A Correct for fact that Jan. and Feb. are treated 

A as if in prior yr (to have leap day at end of yr): 
YYYY¢YYYY+MM<2 

A Repack and reshape result: 
YYYYMMDD¢SHAPEoeDD+(100XMM)+10000XYYYY 
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CWSID: DATES] 
V YYYYMMDD¢FROMDAYSA DAYS;L4;L400;MM;N1;N4;3;N100;N400; 
YYYY;QO10 
[1] a Converts number of days since Feb. 29, 000 
[2] aA to date (CYYYYMMDD). 
[3] HIO¢+1 
[4] aA Reduce no. days by 1 so day 0 is Mar. 1, 0000: 
C5] DAYS©¢DAYS+ 1 
[6] aA No. of 400 year cycles (146097 days) preceding 
[7] aA each date: 
{8 ] N400¢€L DAYS+146097 
[9] A No. days since last 400 year cycle: 
[10] DAYS¢DAYS-N400x146097 
[11] A Flag 400 year leap dates (e.g. Feb. 29, 1600) 
[12] A and change to Feb. 28: 
[13] L400¢€DAYS=146096 
[143] DAYS€DAYS-L400 
[151 A No. of 100 year cycles (36524 days) preceding 
[16] A each date: 
[17] N1l100¢LDAYS+36524 
C18] A No. days since last 100 year cycle: 
[19] DAYS¢DAYS-N100x36524 
[20] A No. of 4 year cycles (1461 days) preceding each 
[21] A date: 
(22] N4¢€¢LDAYS+1461 
[23] A No. days since last 4 year cycle: 
[24] DAYS¢DAYS-N4x1461 
[25] A Flag 4 year leap dates (e.g. Feb. 29, 1988) 
[26] A and change to Feb. 28: 
C27] L4¢DAYS=1460 
[28] DAYS<¢<DAYS-L4 
[29] A No. of 1 year cycles (365 days) preceding each 
[30] A date: 
[31] Nl¢LDAYS+365 
[32] A No. days since last 1 year cycle: 
[33] DAYS¢DAYS-N1x365 
[34] A Increase no. days by 1 so days are 1 to 365: 
[35] DAYS¢DAYS+1 
[36] A Determine month no. from no. days from start of yr: 
C37} MM¢C31 30 31 30 31 31 30 31 30 31 31 28 /24112) [DAYS] 
[38] A Determine day no. from no. days from start of mon.: 
[39] DAYS¢DAYS-(306 337 0 31 61 92 122 153 184 214 245 275) 
CMM] 

[40] A Add back in leap days: 
[41] DAYS©€DAYS+L4+L400 
[42] a Determine year from no.s of 400, 100, 4, 1 year 
[43] A cycles: 
[44] YYYY¢N1+(4xN4)+(€100xXN100)+400xN400 
C45] a Correct for fact that Jan. and Feb. are treated as 
[46] A if in prior year (to have leap day at end of yr): 
C47] YYYY¢YYYY+MM<2 
[48] A Pack year, month, day together: 
[49] YYYYMMDD¢DAYS+100xXMM+100xYYYY 

V 
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CWSID: DATES] 
V QTS€¢TOOTS YYYYMMDD 
C1] a Converts date (CYYYYMMDD) to OTS format (CYYYY MM DD). 
C2] QOTS¢€ O 100 100 TYYYYMMDD 
4 


[WSID: DATES] 
V YYYYMMDD¢FROMOTS QTS 
C1J] a Converts date from OTS format CYYYY MM DD) 
[2] aA to YYYYMMDD format. 
C3] YYYYMMDDe 10000 100 1 +.xQTS 
[4] aA Alternative: 
[5] aA YYYY< 0 100 100 1QOTS 
V 


[WSID: DATES] 
V DAYS¢TODAYS360 YYYYMMDD 
Converts dates in form YYYYMMDD to days since 
January 1, 0000 assuming a 30 days per month, 
12 months per year, 360 days per year calendar. 
The 31st day is treated like the 30th. 


C1) 
C2] 
C3] 
[4] 
[5] 
[6] Change DD=31 to DD=30 and subtract 1 from all days 
C7] and 1 from all months: 
C8] YYYYMMDD¢YYYYMMDD-101+31=100!IYYYYMMDD 
[9] DAYS€ 360 30 1 +.x 0 100 100 TYYYYMMDD 
[101] a Alternative: 
[11] aA DAYS€ 0 12 30 1 0 100 100 TYYYYMMDD 

Vv 


DDDDBDDD 


C[WSID: DATES] 

V YYYYMMDD<FROMDAYS360 DAYS 
C1] aA Converts days since December 30, “1 to dates in 
[2] aA form YYYYMMDD assuming a 30 days per month, 12 
C3] aA months per year, 360 days per year calendar. 
[4] aA 
[5] A Add 1 to all days and 1 to all months (101): 
C6] YYYYMMDD<101+ 10000 100 1 +.x O 12 30 TDAYS 
[7] a Alternative: 
C8] A YYYYMMDD¢<101+ O 100 100 1 O 12 30 TDAYS 

V 
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PROBLEMS: (Solutions on pages 364 to 365) 


1. 


Most bonds pay semi-annual coupons (interest payments). That is, 
every Six months the holder of the bond receives one coupon 
payment to compensate the holder for the use of his or her 

money. The bond has printed on it a maturity date, i.e. the date 
when the final semi-annual coupon is to be paid and when the face 
(par) value is to be repaid. When a bond is sold prior to 
maturity, the purchase date usually falls somewhere between two 
coupon dates. Since the seller and the buyer each hold the bond 
during a portion of the 6 month coupon period, each is entitled 
to a portion of the next coupon payment. Traditionally, the 
buyer pays the seller a portion (called the accrued interest) of 
the next coupon in addition to the agreed-upon purchase price. 
The number of days the bond was held by the seller is compared to 
the number of days the bond will be held by the buyer and the 
coupon is divided proportionately. The number of days is 
computed using a 360 day year (12 months of 30 days). 


Given two vectors PDATES and MDATES which respectively represent 
the purchase dates and maturity dates (in YYYYMMDD representation) 
of a set of bonds, determine the fractions of the coupons paid 
for accrued interest at purchase. 


Suppose you borrow $1000 and agree to pay .1% (.001) of the 
outstanding balance per day. If the variables BDATE and RDATE 
represent the dates (in YYYYMMDD format) on which you 
respectively borrow and repay the loan, how much interest do you 
pay? 


Dates stored in the YYYYDDD representation (e.g. 1986081 for 
March 22, 1986) are sometimes called "Julian" dates. The last 
three digits represent the number of days from the previous 
December 31. Write the utility function TOYD and FROMYD which 
may be used to convert dates in the YYYYMMDD representation to or 
from the Julian representation. 


What expressions will return the day of week today as a character 
vector (e.g. 'TUESDAY')? (Hint: Feb. 29, 000 was a Tuesday.) 
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WRITING REPORTS 


Report formatting in APL is an afterthought. It was an 
afterthought to those who designed and implemented APL. And it is 
frequently an afterthought to those who use APL. The APL language 
excels at manipulating large multi-dimensional arrays, not at 
inserting dollar signs and decimal points. The task of designing and 
implementing reports is slow and tedious and not relished by many 
programmers, APL or otherwise. 


Excellent report formatting capabilities have evolved in the various 
implementations of APL over the years. These have greatly improved 
the productivity of the APL programmer but probably not to the extent 
that report formatting is fun. The capability available in APL*PLUS 
and in SHARP APL is OFMT. In APL2 it is format by example (dyadic #6 
with character vector left argument). In unenhanced versions of APL, 
it is format (dyadic 6 with numeric vector left argument). 


In this chapter, we will describe techniques and utilities which can 
be employed to make report formatting easier and almost enjoyable. 


Bug Ne Au PA RN Pus Ps GS Pus Ps A PN Ba Ba Bs Pu Oe PR Pu PN A Ns Pua us ee Ps 


PROBLEM: How would you construct the following report? 


CAMPBELL CARPET CLEANING 
ANNDAL SUMMARY 


12/31/1986 
PERCENT 
ACCOUNT BUDGET ACTUAL DIFFERENCE 
GROSS REVENUES 650 625 ~3.8 
LESS DISCOUNTS 50 45 10.0 
NET REVENUES 600 580 ses 
EXPENSES 500 510 ~2.0 


NET INCOME 100 70 ~30.0 
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TOPIC: Viewing the Report 


Imagine transcribing this report onto a piece of graph paper (evenly 
spaced vertical and horizontal lines) by placing each letter, digit 
or other character into a single square. Viewed in this way, the 
report appears to be a simple character matrix. Your task is to 
construct the character matrix. 


At the simplest conceptual level, you need only use reshape (oe): 
REPORT€¢12 450’ CAMPBELL CARPET.... ~30.0 ' 


This is the most computationally direct and efficient way to 
construct the report. While the burden is light on the computer, 
however, it is enormous on you. You must count spaces and type them 
in precisely and you must type in the numbers whose values are 
probably already in the computer as vector or matrix variables. This 
is a tremendous waste of your time. 


It is more natural to view the report as a set of submatrices which 
can be pieced together to form the overall report. Then your task is 
to construct each piece and to catenate them together to construct 
the whole report. 


How do you subdivide the report? You should break it into as few 
pieces as possible where each piece may be constructed by a single 
straightforward procedure. Here is one possibility: 


CAMPBELL CARPET CLEANING 
ANNUAL SUMMARY 
12/31/1986 


PERCENT 
ACCOUNT BUDGET ACTUAL DIFFERENCE 


GROSS REVENUES 
LESS DISCOUNTS 

NET REVENUES 

EXPENSES 

NET INCOME 


By using formatting utility functions or APL primitive functions, you 

can construct each of these blocks with relative ease. Once 

constructed, they may be pieced together by a single statement: 
REPORT¢TOP, CL JMIDDLE, [C1] LEFT,RIGHT 


If you do not have access to utility functions or are using poorly 
designed functions, you will be forced to break the report into 
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smaller pieces and construct it in more steps. More steps means less 
productivity. 


Of the four blocks above, the one which may be constructed directly 
by using APL primitive functions is the matrix of numbers in the 
lower right of the report. Assuming a 3 column numeric matrix of 
values named DATA, you may construct the block by the following: 


RIGHT¢(7 0 8 011 1 DATA), CClEEDATA) ,2)0' ' 
Or: 
RIGHT€209 0 8 0 11 1¢DATA 


(Note that the 2 columns of trailing blanks could have been 
considered a fifth block of the report.) 


If your APL implementation has an enhanced formatting capability, you 
may choose to use it rather than 6. For example, with OFMT, you may 
use: 

RIGHT¢€'I7,I8,F11.1,X2' OFMT DATA 
In APL2, you may use: 


RIGHT<'5555550 5555550 5555550.00 ‘'#séDATA 
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PROBLEM: Design utility functions which will construct the three 
remaining blocks in the report above. 


TOPIC: Constructing Titles and Headings 


The top block is a block of titles. When you look at that block, you 
see the words and date, not the blanks. You see three strings of 
nonblank characters on three separate lines, centered within the 
width of the report. The information which completely specifies this 
block is: 


* the width of the report (45 characters) 


* the fact that each line is centered within the report width 
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* the nonblank strings to be used, one per line (4 lines): 


CAMPBELL CARPET CLEANING 
ANNUAL SUMMARY 
12/31/1986 

Cempty ) 


The left block is a block of row names. It is similar to the block 
of titles except the nonblank strings are left-justified within the 
width of the block (except for one indented line). The information 
which specifies this block is: 


* the width of the block (17 characters) 
x the fact that each line is left-justified within the block 
* the nonblank strings to be used, one per line (5 lines): 


GROSS REVENUES 
LESS DISCOUNTS 

NET REVENUES 

EXPENSES 

NET INCOME 


The specifications for these two blocks suggest the syntax for a 
formatting utility function which we will call TITLES. Let us 
illustrate the syntax before we define it: 


TOP¢+45 TITLES 'NCAMPBELL CARPET CLEANINGNANNUAL SUMMARY 
N12/31/1986n' 


LEFT€17 TITLES 'CGROSS REVENUESC LESS DISCOUNTS 
CNET REVENUESCEXPENSESCNET INCOME’ 


The left argument is an integer scalar of the width of the resulting 
character matrix. The right argument is a delimited character vector 
whose partitions each begin with one of the delimiters c (left- 
justify), n (center) or > Cright-justify). The result has one row 
per partition. Each partition is justified within the row according 
to the delimiter. 


The TITLES function is developed and explained in the Positioning 
Character Data chapter. 


The remaining (middle) block is a block of column headings. When you 
look at the block, you see four headings. The rightmost heading has 
two lines. The lines are centered with respect to each other. Every 
heading is underlined. Each set of underlines is separated from its 
neighbors by two spaces. The headings are centered with respect to 
the underlines. 


We will illustrate and then define the syntax for a formatting 
utility function which we will call HEADINGS. 


-148- 


Chapter 10 WRITING REPORTS 


MIDDLE¢17 6 6 10 HEADINGS 'nNACCOUNTNBUDGETNACTUAL 
NPERCENT¢DIFFERENCE’ 


The right argument is a delimited (by leading n symbols) character 
vector whose partitions each represent one heading. The partitions 
themselves may be delimited by newline delimiters (¢) which indicate 
the points at which headings are broken into multiple lines. The 
left argument is an integer vector of the widths of the fields into 
which the headings are inserted. Typically, one width is provided 
for each heading (partition). However, if fewer widths are provided, 
they are repeated to match the number of partitions. 


The subpartitions of each partition are truncated if necessary to the 
corresponding width for that heading. The subpartitions are centered 
above one another within the width for that heading. A row of 
underlines Chyphens) is placed below each heading across the width of 
the heading. The headings are separated by 2 blank columns. Ifa 
separation of more or less than 2 blank columns is desired, you may 
include a vector of the desired separations after the vector of 
widths in the left argument. They will be repeated if necessary to 
match the number of partitions. For example, to have one blank 
column between each heading in the example above: 


MIDDLE¢17 7 7 11 1 HEADINGS '/nACCOUNTNBUDGETNACTUAL...' 


The HEADINGS function is developed as a problem at the end of the 
Positioning Character Data chapter. 
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PROBLEM: Most primitive APL formatting capabilities are column 
oriented. That is, each column of a numeric matrix to be 
formatted is treated as a separate entity. Every row of 
the column is formatted in the same way as every other 
row. This is not true of every column. For example: 


606162 8 3 3 19 
1 2.0 3.00 
4 5.0 6.00 
7 8.0 9.00 


What approach would you take to do row oriented formatting. 
For example, how would you generate the following? 


1 2 3 


4.0 5.0 6.0 
7.00 8.00 9.00 
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TOPIC: Row Oriented Formatting 


A common approach to this problem is to simply format each row 
separately. For example: 


R¢3 180’ ' 

Rl1;1<€6 OSFMATL[1;] 
R£L2:]¢€6 1¢6MAT(2;3 ] 
RL3;1]¢€6 26MAT(L33; ] 


This process may be tedious if the matrix has many rows. The problem 
may be solved noniteratively by transposing the data, formatting it 
with a column oriented formatting function and then transposing it 
back. Both monadic and dyadic applications of transpose (&) are 
required. The specific logic required is presented as a problem at 
the end of the Positioning Character Data chapter. Using the ROWFMT 
function developed there, the solution is: 


width¢6 
R¢0 1 2 ROWFMT MAT 


PROBLEM: You wish to print 50 checks. You are given 5 global 
variables: CNUM (50 element integer vector of check numbers, 
e.g. 305); CDATE (50 element integer vector of check dates 
in MMDDYY format, e.g. 121586); VENDOR (50 row, 25 column 
character matrix of vendor names, e.g. 25T’'ACME SUPPLIES’): 
CAMT (50 element integer vector of check amounts in cents, 
e.g. 518250); DESC (50 row, 40 column character matrix of 
check descriptions, e.g. 40T’LOOSELEAF NOTEBOOKS’). Write 
a function to print the information on continuous form blank 
checks loaded in the printer. The following is an 
illustration of the layout and characters to be printed for 
a Single check: 


NO. 00305 DEC 15, 1986 


TO: ACME SUPPLIES $5,182.50 


FOR: LOOSELEAF NOTEBOOKS 
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TOPIC: Formatting Multi-Row Records Using Newlines 


When doing simple formatting of a numeric matrix (e.g. 7 O&NMAT), the 
character matrix result contains one row pér row of the matrix being 
formatted. In the problem above, the result appears to contain one 
matrix (a check) per row Cor element) of the arrays being formatted. 
The term "appears" is used because the "matrix" may in fact be a 
vector with embedded newline (carriage return) characters. Let us 
look at the above example in a special way: 


~.- NO. OO305bbbbb DEC 15, 1986nnn TO: ACME SUPPLIES 


— ee eee oe 


ee $5,182.50nnnFOR: LOOSELEAF NOTEBOOKS Sn Rice nnnnnn'’ 


The n represents a newline character and the b represents a backspace 
character. Notice how the backspaces are being used to underline the 
check number. The entire vector, including newlines, backspaces and 
blanks is 174 characters long. The problem becomes much simpler if 
you consider that your task is to format and combine the arrays into 
a 174 column matrix, with one row per element or row of the arrays. 


Begin by breaking the CDATE variable into two parts: the month (as a 
3 letter abbreviation) and the day/year portion (DDYY): 


MON¢12 30’ JANFEBMARAPRMAYJUNJULAUGSEPOCTNOVDEC' 
MON€MONT | CDATE+100003 ] 
DDYY<10000 1 CDATE 


Let us solve the problem first for those APL implementations which 
have OFMT (e.g. APL*PLUS and SHARP APL). Given the appropriate 
format string CFS) left argument to OFMT, the desired matrix result 
(CHKS) may be constructed as: 


CHKS¢FS OFMT CCNUM;MON; DDYY; VENDOR; CAMT 3; DESC) 


The format string left argument of OFMT may contain special 
characters such as newline and backspace. Construct the variables NL 
and BS to contain the newline and backspace character scalars 
respectively: 


APL* PLUS SHARP APL 
BS€OTCBS BS¢0OAVL158+0I0) 
NL¢OTCNL NL¢OAVL156+01I0] 


The format string is constructed by carefully piecing together the 
control characters needed to produce the special 174 column matrix. 


FS¢'X%36,6N0O~. [32715.-", (C5eBS) 4.056" 5" Sy X35 GAL; ! 
FS¢FS, 'S<9?>G< ??, 197?7?>,<',C3eNL),' TO: >,’ 
FS¢FS,'25A1,P<$>CK 2F15.2,<',(3eNL),'FOR: >,’ 
FS¢FS,'40A1,<’ ,(6eNL),'>' 


To print the checks, Just display the variable CHKS on the printer. 
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Now let us solve the problem using format by example (dyadic 6 with 
character vector left argument, an APL2 enhancement). The approach 
is basically the same. However, # does not allow more than one array 
in its right argument. We must therefore build the result in 
sections and then combine the sections. 


BS¢OTCLOIOI 
NL¢OTCL1+0I0] 
ROWS¢oCNUM 


S1¢€CC36e’ '),'NO. 05555’ ,(50BS),(050’_'),30e’ ')6 
CROWS ,1) 0 CNUM 


S2€MON (MON and DDYY from above) 
S3<«C’ 05,_5555' ,(3eNL),’ TO: ')& CROWS ,1) 019004100001 

0 100TDDYY 
S4¢VENDOR 
S5€(0'$555,555,553.50', €3eNL),’ FOR: ')6 CROWS , 1) o0CAMT+100 
S6¢DESC 


S7¢( ROWS, 6)oNL 


CHKS€S1,52,53,54,55,56,S7 
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PROBLEM: Frequently, when an APL application produces multi-page 
reports, the reports are sent to a "print file" rather than 
printed directly. Why? How should such a print file be 
organized? How should its contents be printed? 


TOPIC: Directing Report Output to Print Files 


There are several reasons for directing report output to a print file 
rather than printing/displaying it immediately: 


1. To avoid the interruption caused by printing when many different 
reports are to be generated. You may generate them all (toa 
print file) and then let them print unattended. 


2. To enable simple and inexpensive restarting. If the paper jams 
or runs out Cor the line to a remote computer drops), you may 
reprint the report without having to regenerate it. 


3. To make multiple copies of a report. The print file may be 
printed repeatedly without regenerating the report. 
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4. To print the report on a remote (batch) high-speed line printer. 
A print file must exist in order for you to submit a batch 
request for remote printing. 


5. To spot-check a lengthy report. If a lengthy report is appended 
to a print file by an applicaton, selected pages may be printed 
and reviewed when deciding whether or not to print the entire 
report. 


The following is a reasonable organization for a print file (assuming 
your APL implementation allows APL files or a reasonable emulation): 


Component Description 


ee ee eel msm mm are eee i i 


1 to 10 Clatent, i.e. empty character matrices: 0 00’') 
9+2xI Character matrix (or character vector with embedded 
newline characters) representation of page I (1,2,3,...) 


of the report. When displayed, the array will require no 
more lines than can be accomodated by the paper on which 

the page is to be printed (typically 66, i.e. 6 lines per 
inch on paper which is 11 inches high). 


10+2xI One column all blank character matrix with as many rows 
aS are required at the bottom of page I to reach the 
bottom of the paper (typically 66 minus the number of rows 
in the matrix stored in component 9+2xI). 


The first 10 (latent) components are included in case your 
implementation has a remote (batch) high-speed line printer 
capability. Typically, these batch facilities require several 
control components at the beginning of your print file. The precise 
Significance of these control components is a function of your APL 
implementation. Include whatever components are needed in your 
environment. 


After the first 10 components, the print file is organized into pairs 
of components, one pair per page. Two components are used per page 
instead of one so that your printfile will not be filled with 80 
column (Cor so) blank rows whose only function is to move the printer 
to the top of the next page. In fact, if file storage is a major 
consideration, you should break each page into many pieces (some 
pieces only one line) so that excess spaces can be omitted. However, 
the file organization would then not allow direct access to a page in 
the middle of the file since you could not determine the component in 
which it begins except by trial and error or by maintaining a 
directory. 


With this file organization, you can immediately tell how many pages 
are on the print file (.5x~10+number of components) and you can 
determine exactly where any page is stored (component 9+2xI for page 
Bi ae 
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To send the contents of a print file to a remote (batch) high-speed 
line printer, you must follow the directions which apply to that 
facility in your environment. However, if you want to print pages on 
your local printer (Cor hardcopy terminal) or just wish to spot-check 
pages on your CRT terminal, the following PRINT function may be 
useful. To use it, you tie (or otherwise activate) the print file 
and provide the tie number (or other file identification) as the 
right argument of PRINT. The dialog will then look something like 
this: 


73 PAGES ON THE PRINTFILE. 


BEGIN ON WHICH PAGE (COR END): 25 
ALIGN PAPER TO PERFORATION AND PRESS RETURN. 


(page 25 prints) 
(page 26 prints) 
(page 73 prints) 
ERASE PAGES? YES 
NO PAGES ON FILE. 


If you respond YES to the ERASE PAGES? question, all pages on file 
will be erased. That is, all but the first 10 components of the 
print file will be dropped. 


In this function, some attention handling code has been included (for 
APL*PLUS or SHARP APL) in case the BREAK key is pressed while the 
pages are printing. In that event, the printing immediately stops 
and you are again asked for the page number on which to begin. 
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[CWSID: PRTFILE] 
V PRINT TIE;A;I3;N;P;0ALX;0PW 
a Prints some or all pages in the printfile tied 
A to TIE. Pages are in components 11, 13, 15,... 
A APL*PLUS attention handling (put OALX in header): 
DALX¢'?L1' 
@ SHARP APL attention handling (put OTRAP in header): 
Aa OTRAP«¢'V 1000 E *L1’ 
a Set print width to avoid APL wrap on long lines: 


C1] 
[2] 
cod 
[4] 
[5] 
[6] 
C7] 


C8] 

[9] 

[10] 
C11] 
C12] 
[13] 
[14] 
[15] 
[16] 
C17] 
C18] 
[19] 
[20] 
[21] 
[C22] 
[23] 
[24] 
[25] 
[26] 
[27] 
[28] 
[29] 
[30] 
C31] 
[32] 
[33] 
[34] 
[35] 
[36] 
[37] 
[38] 
[39] 
[40] 
C41] 
[42] 
C43] 
[44] 
C45] 
[46] 
[47] 
[48] 


OPwWe250 

A Number of pages in the file: 

N¢eCCOFSIZE TIE)€C2)]-11)+2 A APL*PLUS 

A N€ECCOUSIZE TIE)£2]-11)+2 A SHARP APL 
ON¢+CéN),' PAGE’ ,CN=1)1’S ON THE PRINTFILE. ' 
A Branch if 0, 1 or many pages: 

lel 

9CEND,L2,L1)01+NL2] 

A Ask for starting page if more than one: 
islet 

M<+P¢’ BEGIN ON WHICH PAGE (OR END): ' 
A¢CeP) 10 

A Reprompt on empty response: 

>(epAILI 

Aa Exit if END entered (even partially): 
>(AA.=(oA)T'END' )oL4 

A Convert response to numeric and validate: 
I¢OFI A A Available on APL*PLUS and SHARP APL 
>(CCL=epIJAA/TEN) 0 L2 

O<«'**x INVALID PAGE NUMBER. ' 

O¢«'*x*x VALID PAGE NUMBERS ARE 1 THROUGH 
Lie’'axx TYPE ''END'’ TO STOP PRINTING.’ 
9>L1 

Aa Align paper: 

L2:f!P¢’ALIGN TO PERFORATION AND PRESS RETURN. ’ 
Ae(oP)I0 

A Exit if END entered: 
+((15pA)AAA.=(eA)T' END’ JoL4 

A Print page and spacing to next page: 
L3:O¢QFREAD TIE,9+2xI A APL*PLUS 

A L3:O0¢0READ TIE,9+2xI A SHARP APL 

O<+OFREAD TIE,10+2xI A APL*PLUS 

A OCOREAD TIE,10+2xI A SHARP APL 

A Branch if more pages: 

>(N2I¢I+1)eL3 

A Erase pages, if desired: 
L4:0¢P¢'’ ERASE PAGES? ' 

>9C'Y’#F1TCoPJJO)eEND 

OFDROP TIE,OL11-CUFSIZE TIE)£2] A APL*PLUS 
A ODROP TIE,OL11-CUSIZE TIE)C2] A SHARP APL 
O<'NO PAGES ON FILE. ' 


[49] END: 
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PROBLEMS: CSolutions on page 366) 


1. Using the HEADINGS function introduced in this chapter, construct 
the following set of column headings. 


LAST YEAR THIS YEAR GROWTH 
a------ 2-H eH ------~------ IN 
AVG. TOTAL AVG. TOTAL TOTAL 
SALE SALES SALE SALES SALES 


— nm a ee et —— ee eee — a ee ee — ae ee oe —e ee ee 


2. Using the numeric scalar DATE which is today’s date in the form 
MMDDYY and the numeric scalar PNO which is the current page 
number, construct the following set of titles using the TITLES 
function introduced in this chapter. 


PAGE 17 
FINANCIAL SUMMARY 


12/15/86 
WESTERN REGION 


3. Given the following numeric matrix, 


NMAT 
1 2 3 4 
5 6 7 8 
9 10 11 12 


13 14 15 16 
17 18 19 20 
21 22 23 24 


how would you format it to appear as follows? 


CMAT 
1 2 3 4 
5.0 6.0 7.0 8.0 
9 10 11 12 
13.0 14.0 15.0 16.0 
17 18 19 20 
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SYSTEM DEVELOPMENT PROCEDURE 


Because APL is so concise, powerful and unrestricted, almost 
anyone can toss together an application system. While experience and 


discipline are useful to have, they are not essential. This is one 
of the reasons why so many APL programmers do not come from 
traditional data processing backgrounds. (The other reason being 


that prolonged use of COBOL tends to rot the brain.) 


If you can solve a problem in APL in one-fifth the time it takes to 
solve it using FORTRAN, it follows that you can develop 5 unreadable, 
unmaintainable APL systems in the time it takes you to develop one 
unreadable, unmaintainable FORTRAN system. To some computer 
scientists, this improvement in productivity leads to a new 
philosophy of system development: throw-away code. The basic idea 
is to write the system fast to get the job done. Then, when 
requirements change and the system is no longer adequate, throw it 
away and build a new one. 


Those who foster the view that APL is the ideal language for writing 
throw-away code are those who would like to see APL thrown away. 
They also tend to kick their pets. 


In many respects, APL is not unlike any other programming language. 
System development should be planned, documented and implemented 
meticulously. If done properly, the system will be a pleasure to use 
and to maintain. If done improperly, the system will be a living 
hell for all those associated with it. Documentation will be scarce, 
if existent. Code will be mystifying, if readable. The user will be 
suicidal, if not homicidal. 


A well-developed system, on the other hand, is easy to recognize. It 
is easy to use so the user rarely needs to refer to the extensive 
user guide. It is reliable and efficient so the technical support 
person rarely needs to scan through the readable code or the 
extensive technical documentation. The word "enhancement" is used 
more frequently than "maintenance" and neither word causes the system 
developer to tremble with anxiety. 
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In this chapter we describe eight steps which should be part of any 
APL system development procedure: 


Familiarization 
Specification 

File design 

Workspace design 

User documentation 

Flow charting 

Coding, typing, testing 
Delivery, training 


On ON FWD 


Yes, in that order. Compare this list to the procedure for the usual 
throw-away APL system: 


1. Coding, typing, testing 
2. Delivery, training 
3. Familiarization COht So that’s what you wanted!) 


I implore you. Please do not dismiss the system development 
procedure outlined here without trying it once. The procedure will 
increase your productivity, improve the quality of your system and 
make the development process more fun. Try it. 
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TOPIC: Familiarization 


During this phase, you become familiar with the problem, not the 
solution. Emphasis is on the needs of the user, not the tools of the 
programmer. 


If a manual system exists and is to be replaced, now is the time to 
study the manual system. If there is no manual system, you should 
talk with the user and "brain-storm" about an ideal system. Sketch 
sample reports and sample input sheets. 


Where will the data come from? Is it readily available? Will the 
value of the system justify the installation and updating of the 
data? How much will the data requirements of the system grow? Will 
the system need to supply data to other computer systems? How 
frequently will reports be generated? Who will use them and why? 
How often will the report formats change? What is the expected life 
of the system? 


The unasked questions which should permeate your thinking are: Is 


this system feasible? Does the user have a clear picture of what 
such a system will be like and how it will interact with the 
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organization? Is the value of the finished system going to justify 
the time and effort required to develop it? 


The familiarization phase could also be called the feasibility phase. 


The phase is complete when both you and the user have a clear 
qualitative understanding of how the system should operate, and are 
both convinced that the system is a good idea. 


TOPIC: Specification 


At the heart of the word "specification" is the word "specific". 
That is what the specification phase is all about: getting 
specific. Put in writing all the details which define the systen. 


Bear in mind during this phase that any system has three major 
aspects: input, processing, output. 


Input: What data items must be supplied to the system? Are there 
other items which are not needed now but may be needed later? How 
many records of data items will the system contain? How fast will it 
grow? From what different sources will the data come? How 
frequently? What is the exact record layout of any data from 
external media (e.g. computer tape)? What is the exact layout of the 
input sheets used to manually enter data? How clean will the data 
be? What data integrity checks must be performed (e.g. salary must 
be a positive integer less than 50,000)? What inter-data 
restrictions must be imposed (e.g. date-of-hire must be earlier than 
date-of-termination)? 


Processing: What data items must be computed from input data items? 
With what formulas? What regular processing operations are to be 
conducted? How frequently? What steps are involved in these 
operations? How often will these steps change and how dramatically? 


Output: What reports will need to be generated by the system? How 
frequently? What is the exact layout of each report? How is each 
report item derived from the input and computed data items? How 
often will the report formats change and how dramatically? Will 
other areas need access to certain data items? In what format? 


The specification phase is complete when you have a written document 
(the "specification") which so thoroughly and specifically describes 
the system that, ideally, it could be given to any expert programmer 
who could then develop the system without conferring with the user. 
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TOPIC: File Design 


During this phase, you draw pictures of alternative file designs. 
You then consider the pros and cons of each design given the input 
and output requirements of the system. 


How many file accesses are required to add one record? To add 100 
records? To change a few items on one record? On 100 records? To 
delete one record? To delete 100 records? To display the entire 
contents of one record? Of 100 records? To search the entire file 
for records which match a set of logical criteria? To generate each 
of the reports included in the specification? How often will each of 
these operations be performed? Will the cost and response time 
resulting from these file accesses be acceptable? 


The file design phase is complete when the structure of each file, 
including its name, is completely documented in written form. 


Ae A A NG Re NS A NG Pe A NS ARS A ANG Pe BO A Re Ow A MNS A NN Bem ASA 


TOPIC: Workspace Design 


During this phase, you sketch sample terminal sessions. The terminal 
sessions illustrate the actual operation of the system including 
system prompts and typical user responses. 


You should work closely with the user during this phase since the 
user must live with the system interaction being designed now. 


Begin with a general flow chart of the operations to be performed by 
the system. Break the flowchart into functional blocks, each of 
which will be implemented as a single APL function. Then, for each 
function, write down the dialog (sample terminal session) produced by 
the function. 


Does the dialog allow for all required input? Does it provide 
control of every processing step? Can any and all reports be 
requested easily? Can the user gracefully exit the system from 
anywhere without losing any input? Can the user quickly navigate 
among the most commonly used operations of the system? Are the 
prompts meaningful? Does the prompting structure allow for optimal 
use of the files, given their design? 
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The workspace design phase is complete when sample terminal sessions 
have been sketched for all contingencies and the user accepts the 
flow of the system without reservation. The major functions of the 
system are identified, named and documented in general terms. The 
file design is updated if necessary. 
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TOPIC: User Documentation 


User documentation may be written before or after the system is 
implemented. It is better to write it before. The only reason for 
writing the documentation after the system is built is that you may 
be able to talk the user out of any documentation at all. Nota 
noble reason. 


By writing the user documentation before the system is implemented, 
you will find that the documentation is much easier to write (fewer 


constraints) and the system is easier to implement. For example, 
before implementation you can write, "Type STOP at any time to 
terminate the system." After implementation you must write, "Type 


STOP to terminate the system when adding records; type END when 
generating reports; type HALT when closing the accounting period; and 
type O-backspace-U-backspace-T if none of these works." 


If you have never documented a system before implementing it, you may 
be reluctant to try it now. Please! Please try it! It will make 
the overall system development task easier and will result in a 
better system. It’s more fun too. Try it once. What can it hurt? 


The user documentation is an instruction manual which explains to an 
inexperienced user how to use the system. It begins with an 
explanation of the purpose of the system and any background 
information required to understand the system. After the 
introduction, the manual consists mainly of the sample terminal 
sessions along with comments explaining the various options. After 
reading a well-written manual, the user should be able to use every 
facet of the system without help from you. 


As you write the manual, you should update the specifications and 
file design if such change is suggested by the documentation 


process. The user documentation phase is complete when the manual 
has been read by the user and accepted without reservation. 
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TOPIC: Flow Charting 


During this phase, you will diagram the program logic which underlies 
each of the major functions identified during the workspace design 
phase. You will do this with the documented file structure on one 
side of your desk and the user documentation on the other side. 


The flow charting phase can be called the "divide and conquer" 
phase. Divide each major function into the general steps which it 
must perform. Then divide the general steps into more specific 
steps. Continue in this fashion until the steps can be translated 
directly into APL code. 


During this subdivision process, you will identify common steps which 
are required in several locations of the system. If such steps are 
not at the low level in which they may be translated into APL code, 
you may choose to label these steps as subfunctions and write their 
comprising steps but once. As you identify each subfunction, you 
should name it, define its syntax, list the variables and functions 
which are global to its operation and write a brief description of 
what it does. This will become a permanent part of the technical 
documentation. 


The flow charting phase is complete when coding is all that remains. 
You will have written descriptions of all functions, subfunctions and 
global variables. You will have flow charts which diagram every 
logical step during the use of the system. The steps will be 
described at such a precise level of detail that they may be 
translated directly (without much logical reasoning) into APL code. 
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TOPIC: Coding, Typing, Testing 


Without having written a symbol of APL code or pulled your chair up 
to your APL terminal, you are more than half done with the system. 

The user has been getting constant feedback from you, has a user's 
manual on his or her desk and has complete confidence in your ability 
to deliver the exact system needed. All this without a symbol of APL. 


If you do not know APL, now is the time to learn it. Quickly. 


During the coding, typing, testing phase, you do just that. The 
three tasks are clumped together because they need not each be 
performed to completion before starting the next task. For example, 
you may want to code 5 or 10 functions, type them, test them and 
repeat this process for the next 5 or 10 functions. 
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While it is possible to code the entire system before typing a single 
keystroke, there are disadvantages to such extreme behavior. For one 
thing, a coding flaw will not be picked up until you begin testing. 
You may have made the same mistake dozens of times throughout the 
system. Second, when you are testing the code, you may not remember 
your intentions in a difficult piece of code. Finally, if doing 
nothing but writing code for 3 weeks does not drive you crazy, then 
doing nothing but typing APL code for 3 days will. And if that does 
not, then 2 weeks of testing will. 


At the other extreme, you may choose to code, type and test one 
function at a time. There are disadvantages to this mode of 
programming. Some design flaws will not be uncovered until you get 
further into the system. Such flaws may require you to rewrite or 
scrap functions written earlier. Any time spent typing or testing 
now obsolete code will have been wasted. 


The coding, typing, testing phase is complete when everything is 
tested and you are ready to turn the system over to the user. 
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TOPIC: Delivery, Training 


During this phase, you will transfer the system from your control to 
the user’s control. You will initialize any files which have not yet 
been initialized and will move the workspace or workspaces to the 
user’s library if they need to be moved. 


When the system is ready to roll, you will meet with the user to take 
a spin. Having read the user documentation (Cand helped you design 
the dialog), the user should require little guidance or training from 
you. As the system is tested, you will need to make two lists. The 
first list refers to bugs which are encountered. If your testing 
process was careful and thorough, this list will be empty. 


The second list refers to suggested enhancements to the system. 
There is nothing like a live system to suggest what is wrong with 
it. The user will be happy to mention these. Since you worked 
together closely to design and document the system, you will not be 
blamed for delivering an imperfect system. Rather, you will be 
commended for delivering what you agreed to deliver. 


The delivery and training phase is complete when the user accepts the 


system as is. If there are enhancements to be made to the systen, 
you should implement them as you did the system: familiarization, 
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specification, file design, etc. For simple enhancements, you may 
get through all the phases in a few minutes. Do not forget to update 
the technical and user documentation. 
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PROGRAMMING STANDARDS 


The reason for programming standards is to create a conformist 
world in which every programmer thinks and programs the same way. 
What these programmer clones lose in creativity they more than make 
up in productivity. After all, when picking up a program written by 
Clone A, Clone B has no trouble reading it. Not only is the language 
familiar; so too is the dialect and the handwriting. 


The purpose of this chapter is to present a set of APL programming 
standards. They are not presented as the perfect set nor even as the 
author's preferred set. Rather, they are a set. Pick and choose as 
they suit you. The important thing here is that the set you choose 
be accepted by all those in your organization who will work on the 
same systems as you. 


The standards are organized by the phases of the system development 
procedures presented in the prior chapter. Along with the dogma of 
each standard is a brief justification for it. If the justification 
is omitted, you may assume it is, "To improve readability by use of 
consistent conventions." 
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TOPIC: Familiarization 


1-1 Select, or have appointed, another programmer to review your 
work. 


WHY: To spot design and logic flaws and otherwise help you see the 


forest from the trees. This is also one of the best ways to 
learn new design and programming techniques. 
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1-2 Make certain a single user has been assigned the responsibility 
of working with you. 


WHY: You need a single person to accept your design and to be held 
responsible for it. 
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TOPIC: Specification 


2~1 Include hand drawn input sheets and full-screen input forms. 
WHY: To insure that you and the user and you see input 
requirements eye to eye. 
2-2 Include hand drawn reports containing exact headings, line names 
and number formats. 
WHY: To insure that you and the user and you see output 


requirements eye to eye. 


2-3 Have someone else review the specification. 
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TOPIC: File Design 


3-1 All file components are assigned a meaningful variable name 
(first letter underscored) which is used when the component is read 
into the workspace. 

WHY: To help anyone reading the code to identify objects from file. 


3-2 The file directory, if there is one, is stored in component 1 of 
the file. 


3-3 On-line file documentation, if any, is stored in component 2 of 
the file. 
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3-4 Leave at least 10 latent (empty vector) components at the start 
of the file for future design modifications. 


3-5 Files are documented on paper (preferably using word processing 
software) and include the component number, variable name, shape and 
description of each object in the file. 


3-6 Have someone else review the file design. 
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TOPIC: Workspace Design 
4-1 Along with each user prompt, list all possible error messages in 
the sample terminal sessions. 


WHY: To insure consistent error messages. 


4-2 Use the following standard user keywords when needed: 
ADD: Add more data to database. 
CHANGE: Replace an existing value with another. 
DELETE: Remove data from database. 
SHOW: Display data from database. 


INSERT: Add more data among existing data in a database where 
order of data is important. 


END: Normal termination of the current phase of the program. 
HALT: Terminate program abruptly and compeltely. 


WHY: To be consistent so the effect of various responses to a 
prompt can be anticipated. 
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4-3 The system is invoked by loading an autostarted workspace (using 
OLX). The functions do not return to immediate execution mode until 
the system is terminated. All input is accepted via character input 
(fT) mode or full-screen input mode rather than evaluated input (0D) 
mode. 


WHY: To eliminate the possibility of accidentally invoking 
non-user functions or of getting APL system error messages 
(e.g. SYNTAX ERROR). 


4-4 Applications are terminal independent unless special features 
are expressly desired. 


WHY: To allow switching from one terminal to another without 
requiring program modifications. 


4-5 All reports are directed to a printfile rather than to the 
terminal. The contents of the printfile must be displayed ina 
separate step. 


WHY: To allow easy, inexpensive report restarting in the event of 
line noise, line drop, printer malfunction or complete 
crash; to allow reference by page number when printing; to 


allow flexibility in directing reports to terminal, line 
printer or remote printer. 


4-6 Have someone else review the workspace design. 


TOPIC: User Documentation 
5-1 Include all possible prompts in the documentation, along with 
descriptive text. 

WHY: To ease the transition from text to terminal. 
5-2 Write the documentation using whatever word processing software 
is commonly used in your department or company. 

WHY: To insure professional appearance of documentation; to allow 

quick, easy modifications; to ease the transfer of system 


support since the same word processing software is used by 
all. 
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5-3 If the dialog or options of the system are necessarily 
complicated, include a brief summary of the workspaces, functions, 
keywords, choices, and so on for quick reference. 


WHY: So the user does not have to thumb through the lengthy 
documentation. 


5-4 Include a complete Table of Contents. 


5-5 Have someone else review the documentation. 
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TOPIC: Flowcharting 


6-1 The purpose of each function is documented in a one-sentence 
description, including the function syntax and its dependence upon 
global variables and subfunctions. All functions have meaningful 
names or abbreviations. 


6-2 The purpose of each global variable is documented in a 
one-sentence description. 


6-3 Functions with same names in different workspaces are identical. 


6-4 Subfunctions are chosen judiciously. They have well defined 
arguments, produce well defined results or effects and require or 
create a minimum number of global variables or other subfunctions. 
The state indicator does not get deeper than 5 levels. 


WHY: All user defined functions call other functions (at a 
minimum, APL primitive functions). The key to readability is 
that the subfunctions can be understood without reference to 
further documentation. If there are too many subfunctions 
and they are not neatly defined, the reader will spend too 
much time flipping back and forth among function listings 
instead of reading code. The state indicator in the human 
brain can generally go no deeper than 5 levels without losing 
track. 
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6-5 Flowchart using words and diagrams, not APL code. 

WHY: To compel the programmer to organize her thoughts and plans 
before getting bogged down in coding details. If coding is 
all that remains, flowcharting is done. 

6-6 Include all error checks in flowchart. 
WHY: To remember them when coding and to not underestimate the 


complexity of the function. 


6-7 Have someone else review the flowcharts. 
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TOPIC: Coding, Typing, Testing 


7-1 All workspaces contain the global variable <wsid> which contains 
the workspace identification (WSID) of the saved workspace. 


7-2 All workspaces contain the global variables <fnums> and <fnames> 
which contain the file numbers and file names of the files which are 
assumed to always be tied. 


7-3 File tie numbers are assigned to global variables whose 
meaningful names are prefixed with 'f' (e.g. £fSMRY, fEMPL). These 
variables reference files which may or may not be tied (see 
<fnums>). The tie number of the printfile is assigned to fPRINT. 


7-4 The name of the printfile is 'PRINTFILE’. 


7-5 All workspaces contain the function TIEFILES which ties those 
files in <fnums> and <fnames>. 


7-6 Workspaces which alter their OLX contain the global <lx> which 
1s assigned the original value of OLX. 


7-7 Variables global to the workspace have meaningful names and are 
completely underscored Cexcept for global variables containing file 
tie numbers). 
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7-8 Variables from file and localized variables used globally by 
other functions have meaningful names and have their first character 
underscored. 


7-9 Functions on file, like variables on file, have meaningful names 
with the first character underscored. 


7-10 Strictly local variables (localized and not used within 
subfunctions) contain no underscores in their names. Meaningful 
variables have meaningful names or abbreviations. Temporary, 
intermediate or useless (e.g. from OEX) results are assigned to 
Single letter variable names and are not used further than 5 
statements beyond the assignment. 


7-11 The meaning of a variable is commented when first assigned 
unless the comment is the name, or the variable is read from a 
documented file, or the meaning can be inferred from a prompt. 


7-12 The first line or two of every function is a comment which 
explains the purpose and syntax of the function, and lists the global 
variables and subfunctions required by the function. 


7-13 When calling a subfunction, include a comment which lists the 
global variables and subfunctions required by the subfunction. 


7-14 The intent of every function line is commented. 


WHY: To help the program maintainer quickly locate and decipher 
code which needs fixing or enhancing. Writing code is the 
process of converting the intent to the code. Reading code 
is the process of attempting to reconstruct the original 
intent based on the code. Since the intent is obvious during 
coding, it can be included ina fraction of the time (and 
mental effort) it would take to reconstruct it later. Lines 
that contain prompts or error messages are often 
self-commenting. Examples of comments which unsuccessfully 
and successfully comment the intent: 


A Squeeze out the flagged rows of the matrix. (no good) 
A Ignore inactive profit centers. (good) 
A Set OELX to capture error messages. Cno good) 
Aa Prepare for file reservation errors. (good) 
A Increment and repeat if not done. (no good) 
A Loop by region. (good) 
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7-15 Line labels are Ll, L2, L3,... and are kept in ascending order 
even if not sequential. Significant labels, which segment the 


function or identify key steps, may have meaningful names. For 
example: 


*(C'ACDE’' =1TR)/ADD, CHANGE, DELETE,END 


7-16 Branching is always to a line label or empty vector (never 70 
OF 3) % 


WHY: Without »0, the program must exit through its "bottom" which 
is better style than having many exit points. For example, 
you may be certain that a statement added to the end of a 
function will always be executed. The use of naked branch 


(>} removes control from cover functions which may call the 
function. 


7-17 Recommended branching techniques are: 


»>LABEL 

»CONDITION/ LABEL 
»>CONDITIONS / LABELS 
»CONDITION! LABEL 
»>LABELSC INDEX] 
»>CONDITION®OLFALSE, LTRUE 


7-18 Recommended looping technique is: 


Tel 
LOOP: »ENDLOOP IF I>LIM 
process I 

T¢eL{[+1 

>LOOP 
ENDLOOP: 


7-19 Use ~2147483647 for numeric values which are "not applicable"; 
assign this constant to the variable <huge> if used frequently. 
7-20 All output to the terminal is through O or @. For example: 


O<«’'ENTER YOUR CHOICE’ 


WHY: To help locate all terminal output if the need arises (e.g. 


to direct output to a file) and to improve readability (e.g. 
O<PROCESS MAT vs. PROCESS MAT). 


=e 


Chapter 12 PROGRAMMING STANDARDS 


7-21 Evaluated input mode (0) is not used. 


WHY: To avoid unintentional escapes via )LOAD or )OFF, to avoid 
unintentional execution of defined functions, and to avoid 
technical error messages (e.g. SYNTAX ERROR) to the user. 


7-22 Maintain OITO=1 globally. Localize DIO if assigned as 0. 


7-23 The local result variable is used only for the result and not 
for temporary values. 


WHY: To avoid unintended results upon premature function 
termination and to avoid confusion when reading the function. 


7-24 The local argument variables are never reassigned except to 
ravel them. 


7-25 Every function line is restartable. That is, no other 
functions should be performed on the same function line once a 
function has been executed whose effect should not be repeated. For 
example, T€¢2+V¢V,R is unrestartable. 


WHY: So that any function line may be restarted from its beginning 
after it has been stopped, say by an error. For example, the 
following suspension cannot be properly restarted via 3 
since V will have been extended twice: 


WS FULL 
MODEL[3] T¢2+V¢V,R 
A 


7-26 Error messages or other error handling logic immediately follow 
detection. For example: 


[10] >(X>0)/1L2 

[11] O<e'*x* VALUE MUST BE POSITIVE’ 
[12] >L1 

[13] L2: etc. 


WHY: To avoid having to search through the function to find the 
code which handles each error condition. 


7-27 Lines containing multiple statements perform a single, logical 
operation. 
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7-28 All error messages are passed through an error-displaying 
function named ERRMSG. 


WHY: To allow consistent presentation of error messages (e.g. 
preceding them by two stars or beeping twice or displaying in 


a specified position on the screen). To enable you to find 
all error messages for inclusion as an appendix in the user 
manual. 


7-29 Testing is performed methodically by stopping on every function 
line, not experimentally (i.e. by jumping from bug to bug). 


7-30 Test all edge (e.g. empty vector) conditions. 


7-31 Have someone else review the code. 
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TOPIC: Delivery, Training 


8-1 The contents of all workspaces are documented using whatever 
workspace documentation software is commonly used in your department 
or company. This software produces a printed, paged listing of the 
definitions of all functions and at least the names and shapes of all 
global variables. 


8-2 Functions are not locked unless you have a significent reason 
for doing so. 
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WORKSPACE DESIGN AND DOCUMENTATION 


For a given computer application, two different programmers will 
design and implement it differently. In fact, a single programmer 
will develop the application differently at different times in her 
own career. Because of the flexibility of APL, a spectrum of 
approaches are both possible and feasible for any problem. How then 
1s one to choose between plausible approaches when designing an 
application system? In this chapter, we discuss workspace design and 
documentation considerations. The aims are to expand your 
appreciation of the trade-offs involved during the design process, 
and to help you document an existing application. 
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PROBLEM: Develop an application which will maintain a list of 
employees. For each employee, maintain the employee’s 
number (4 digits), name (last name first) and age. Do not 
use files for this application. Rather, store the 
information in the global variables ENUM Cinteger vector), 
ENAME (25 column character matrix) and EAGE Cinteger 
vector). Provide capabilities for adding, deleting and 
listing employees. 


TOPIC: Subfunction Design 


As simple as this application is, no two programmers will develop it 
exactly the same way. The most pronounced difference between 
solutions is the degree to which subfunctions are employed. At one 
extreme, a single function is written which calls no subfunctions. 

At the other extreme, a primary (or main or cover or driver) function 
is written which calls a variety of subfunctions which in turn call 
subfunctions and so on as desired. 


saat OT 2S va 


Chapter 13 WORKSPACE DESIGN AND DOCUMENTATION 


The following is an illustration of the APL code written to implement 
the application as a single function. 


[WSID: FLF] 
V EMPLOYEES sAGE;G;GOOD;NAME;NUM;P;R 


i | 
[2] 
[3] 
[4] 
[5] 
C6] 
C7] 
C8] 
[9] 


a Ask for choice on same line: 

CHOOSE: fM¢P¢«'ADD, DELETE, LIST OR END: ' 
Re CoP)lO 

A Branch based on lst char of response: 
>(C'ADLE’=1TR)/ADD, DELETE, LIST, END 
O<e'*x* INVALID CHOICE. CHOOSE FROM: ADLE’ 
»CHOOSE 

A 

A 


[10] ADD: 0¢'EMPLOYEE NUMBER (COR O IF DONE) ' 

Ci1lJ NUMe,O 

[12] A Continue if exactly 1 number entered: 
[13] 23C1=oNUM)/Al1 

{[14] QO¢'*«* ENTER 1 NUMBER’ 

[15] ADD 

[16] A Branch to choice question if 0 entered: 
[17] Al:>?CO=NUM) /CHOOSE 

[18] A Continue unless employee number already exists: 
[19] 7» €NUMEENUM)JA2 

[20] O«'’** EMPLOYEE ’,CéNUM),' ALREADY IN LIST’ 
C21] ADD 

C22] A2:0¢P¢'’ EMPLOYEE NAME (MAX 25 CHARACTERS): ' 
[23] A Ask for name at end of same line: 

[24] NAME¢(oP)10 

[25] a Continue unless name too long: 

[26] %(2520NAME)/A3 

C27] O¢'** NAME TOO LONG’ 

[28] ~A2 

C29] A3:0¢’EMPLOYEE AGE’ 

[30] AGE¢,DO 

[31] A Continue if exactly 1 number entered: 
[32] %(1=pAGE)/A4 

[33] O¢’'** ENTER 1 NUMBER’ 

[34] A3 

[35] A Continue if a valid age: 

[36] A4:3( CAGE=[AGE)/ACAGE217) AAGE*S99)/A5 

[37] O¢’** AGE MUST BE INTEGER FROM 17 TO 99' 
C38] %A3 

[39] A Catenate new values and ask for more: 
C40] A5: ENUM¢CENUM,NUM 

[41] A Pad name to length 25: 

C42] ENAME¢CENAME,C(11]25TNAME 

[43] EAGE¢EAGE , AGE 

[44] ADD 

C45] A 

[46] A 

[47] DELETE: 0¢’ ENTER EMPLOYEE NUMBERS TO DELETE’ 
[48] A Ravel to insure a vector, not scalar: 
[49] NUM¢,O 


~ 7 6> 


Chapter 13 WORKSPACE DESIGN AND DOCUMENTATION 


VY EMPLOYEES (continued) 
[50] aA Continue if all valid numbers: 
C51] »*C€CA/GOOD¢NUMEENUM) /D1 
[52] Oe’sxs*e NOT FOUND: ',6(~GOOD)/NUM 
[53] 7*DELETE 
[54] A Flag those employees to keep: 
[55] D1: GOOD¢~ENUMENUM 
[56] Aa Squeeze out deleted employees: 
[571 ENUM¢GOOD/ENUM 
C58] ENAME¢GOOD/ENAME 
[59] EAGE¢GOOD/EAGE 
[60] 7*CHOOSE 
[61] a 
[62] A 
[63] LIST:0¢'NUMBER AGE NAME’ 
[64] Oe'’ 
[65] A Prepare to sort employees by number: 
[66] Ge¢sENUM 
[67] A Sort and display: 
[68] O¢(5 0 7 O SENUM[(G],[(1.5]EAGE[G]),CCCoOENUM) ,3)0e’ '), 
ENAME[G; ] 
[69] Oe¢’’ 
[70] »CHOOSE 
C71] A 
[72] A 
[73] END: 
V 


The following is an illustration of the APL code written to implement 
the application in highly subfunctionized fashion. EMPLOYEES is the 
Qriver function. 


CWSID: MSF] 

V ADDEMP;AGE ;NAME;NUM 
C1] Al:NUM¢NINPUT 'EMPLOYEE NUMBER (OR O IF DONE)’ 
[2] a Exit if 0 entered: 
C3] 90 IF O=NUM 
[4] aA Continue unless employee number already exists: 
C5] ~A2 UNLESS NUMEENUM 
C6] [ie '’*x* EMPLOYEE ',(6NUM),’ ALREADY IN LIST’ 
[7] 9Al 
[8] A2:NAME¢CINPUT '/EMPLOYEE NAME (MAX 25 CHARACTERS): ' 
C9] aA Continue unless name too long: 
C10) %A2 IFC25<oNAME)MESSAGE '** NAME TOO LONG’ 
C11] A3:AGE¢NINPUT '/EMPLOYEE AGE’ 
C12] ~»%A3 IFC CAGE#!I AGE) VY CAGE<17)VAGE>99)MESSAGE '** AGE MUST 

BE INTEGER FROM 17 TO 99' 

C13] A Catenate new values and ask for more: 
[14] CATEMP 
[15] Al 

V 
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[WSID: MSF] 
V CATEMP 
C1] ENUM¢ENUM,NUM 
[2] aA Pad name to length 25: 
E34 ENAME¢ENAME RCAT NAME 
[4] EAGECEAGE , AGE 
V 


[WSID: MSF] 
VY R¢ECINPUT PROMPT 
[1] aA Display prompt and ask for response on same line: 
[2] M%<¢PROMPT 
C3] R¢CoPROMPT) JO 
Vv 


CWSID: MSF] 
V DELEMP;GOOD;NUM 
C1] L1:0¢’ENTER EMPLOYEE NUMBERS TO DELETE’ 
[2] aA Ravel to insure a vector, not scalar: 
C3] NUMe ,O 
C4] aA Continue if all valid numbers: 
[5] 9L2 ILFA/GOOD¢NUMEENUM 
C6] Ne'*x*x NOT FOUND: ’',8(0~GOOD)/NUM 
C7] 9L1 
[8] a Flag those employees to keep: 
[9] L2 : GOOD¢~ENUMENUM 
[10] A Squeeze out deleted employees: 
[11] SQZEMP GOOD 
V 


CWSID: MSF] 
V EMPLOYEES;R 
C1] aA Ask for choice: 
[2] CHOOSE:R¢’ADLE’ SELECT 'ADD, DELETE, LIST OR END: ' 
[3] A Branch based on response: 
[4] »* (ADD, DELETE, LIST,END) ER] 
[5] A 
[6] ADD:ADDEMP 
C7] »CHOOSE 
C8] aA 
[9] DELETE: DELEMP 
[10] ~*»CHOOSE 
C11] aA 
C12] LIST: LISTEMP 
[13] »CHOOSE 
[14] a 
[15] END: 
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C1] 
C2] 
C3] 
C4] 
C5] 
C6] 


[7] 


C1] 
C2] 
C3] 


Ci] 
C2] 
C3] 
[4] 
[5] 
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C5] 
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[WSID: MSF] 
V RELINE IF CONDITION 
R¢CONDITION/ LINE 
Vv 


CWSID: MSF] 

V LISTEMP;G 

O«’'NUMBER AGE NAME ' 

fj’ ' 

A Prepare to sort employees by number: 

G¢eAENUM 

a Sort and display: 

D<+(€5 0 7 O SENUM[G],[1.5]EAGE(G]),(C€CCoENUM) ,3)0' '), 

ENAME[G; ] 

fe’! 
V 


[WSID: MSF] 
V R€¢CONDITION MESSAGE CVEC 
R€ECONDITION 
>0 UNLESS CONDITION 
O<CVEC 
V 


CWSID: MSF] 
V R¢ENINPUT PROMPT 
L1:O0¢PROMPT 
A Ravel response to insure a vector, not scalar: 
Re ,O 
A Exit if exactly 1 number entered: 
9L1 IFC1#eRJMESSAGE '** ENTER 1 NUMBER’ 
V 


[WSID: MSF] 
Vv ReM RCAT V 

RM, (11(1loM)tV 
V 


CWSID: MSF] 
V IND¢CHOICES SELECT PROMPT;R 
A Ask for choice: 
ASK: R€CINPUT PROMPT 
a Search vector of choices for ist char of response: 
IND¢CHOICESLi1TR 
Aa Ask again if not a valid choice: 
29ASK IFCIND>epCHOICES )MESSAGE '* INVALID CHOICE. 
CHOOSE FROM: ',CHOICES 
V 
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[WSID: MSF] 
V SQZEMP BIT 
C1] ENUM¢BIT/ENUM 
C2] ENAME©BIT/ENAME 
C3] EAGE¢BIT/EAGE 


C[WSID: MSF] 
V R€LINE UNLESS CONDITION 
Ci] R¢CONDITIONJ LINE 


By any measure, this sample application is tiny. Yet the advantages 
and disadvantages of these two extreme approaches emerge even in an 
application of this size. As the application grows, the differences 
become more important, even critical. For easy reference, we will 
use the abbreviations FLF (few large functions) and MSF (many small 
functions) to refer to the two extreme approaches illustrated above. 
Let us discuss the pros and cons of each. These considerations 
should be kept in mind when developing an application system so that 
the cons are minimized. 


1. Utility Functions 


A utility function is a usually small (under 20 statements) 
subfunction which performs a common task and which usually gets its 
inputs entirely from its arguments (vs. from global variables) and 
returns its outputs as an explicit result. A well designed utility 
function will resemble a primitive APL function in its behavior. 
Some examples of the application of utility functions in the 
illustrations above include: 


~A2 UNLESS NUMéEENUM 
9A2 IF (€25<opNAME) MESSAGE '** NAME TOO LONG’ 
NAME¢CINPUT ’'EMPLOYEE NAME (MAX 25 CHARACTERS): ' 


The FLF approach avoids the use of utility functions while the MSF 
approach uses them generously. This is a pro for MSF and a con for 
FLF. Utility functions provide two distinct advantages, both of 
which improve programmer productivity. The first is that a utility 
function can replace several lines of common code, allowing you to 
write and test code faster. For example, compare the following: 
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AGE¢NINPUT 'EMPLOYEE AGE’ 
Vs. 


L1:0¢’EMPLOYEE AGE’ 
AGE¢€ , 0 

>*(1=o9AGE)/L2 

O+«'*x* ENTER 1 NUMBER’ 
»L1 

L2: 


The second advantage is that utility functions can improve code 
clarity, allowing you to read code faster. For example, compare the 
following: 


9A2 UNLESS NUME€ENUM 
vs. 


> (NUMé€ ENUM) JA2 
or +(~+~NUMEENUM) /A2 


2. Global Changes 


After coding and testing an application, you may be asked by the user 
to make a change which is pervaSive. For example, "Remove the dollar 
signs from all the numbers being displayed," or 'Precede all error 
messages by a 10 space indent and 3 stars," or "When prompting for 
numbers, await the response on the same line as the prompt." If you 
have taken the MSF approach and have used your subfunctions 
consistently, you may be lucky enough to only have to change a single 
function. For example, if all error messages are being displayed 
within a subfunction MESSAGE, you may be able to implement the second 
request above by changing the line of MESSAGE, 


O¢+CVEC 
to 
{le ' kkk’! ,CVEC 


This is a pro for MSF and a con for FLF. However, if you have not 
used your subfunctions consistently, much of the advantage will be 
lost. For example, if some error messages are being displayed 
directly and not within the subfunction MESSAGE, you will be forced 
to conduct an extensive search for those messages. This -is exactly 
what you need to do if you used the FLF approach. Since FLF involves 
fewer functions than MSG, a slight advantage goes to FLF. 
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3. Function Size 


There exists a school of thought in the community of APL programmers 
that the ideal size of a function is a page. No line should be wider 
than a page (i.e. 80 characters or so) and no function should have 
more lines than will fit on a page (i.e. 50 lines or so). The 
rationale for this conviction is that the eye and the brain can 
retain no more than a page or so at a time. Further, by confining 
each function to a page, the programmer is forced to discern the 
forest from the trees. The outline of the program logic is placed in 
the higher level function and the detailed logic is included in 
subfunctions. The FLF approach violates this standard without 
remorse. The MSF approach adheres to it rigorously. 


Score one for MSF if you want to see the forest and not the trees. 
Score one for FLF if you want to see the forest and the trees. You 
can find the trees with the MSF approach but not without leaving the 
forest (i.e. flipping to another page). 


At any rate, smaller function size is a definite pro for MSF when it 
comes to function editing. If you use a full-screen editor, a small 
function will fit nicely on a single screen. If you use a 
line-oriented editor, a large function will suffer more from line 
renumbering (due to line additions or deletions) than will a small 
function. For example, if line 15 of a 300 line function is deleted, 
285 lines will be renumbered and may need to be reprinted. If line 
15 of a 30 line function is deleted, only 15 lines will be renumbered. 


4. Self-Containment 


The issue of self-containment becomes most obvious when you load the 
workspace and explore its contents. For FLF: 


JFNS 
EMPLOYEES 
For MSF: 
JFNS 
ADDEMP CATEMP CINPUT DELEMP EMPLOYEES IF 
LISTEMP MESSAGE NINPUT RCAT SELECT 


SQZEMP UNLESS 


If you decided to merge this application with another, it will be 
easier to accomplish with the FLF approach. Since the entire 
application is contained within a single function, the application 
can be moved about (copying or erasing) as easily as moving the 
function. With the MSF approach however, you need to be careful that 
name conflicts (i.e. functions with the same name in two different 
applications) do not exist. 
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For example, if you copy the above MSF functions into another 
application workspace in which a different SELECT function is 
defined, one of the two SELECT functions will be erased (depending on 
whether COPY or PCOPY is used). Likewise, if you then erase the 
above MSF functions from the merged application workspace, the 
remaining application may no longer work without the IF or CINPUT 
functions. 


Finally, any good written technical documentation will include at 
least a brief description of each function in an application. The 
task of writing that documentation is considerably simpler for FLF 
than for MSF since there are fewer functions to document. 


5. Global Passing 


Another school of thought in the community of APL programmers states 
that global variables should be avoided as much as possible. Data 
should be passed to functions as arguments. There are three reasons 
for this view: 


A. Less confusion. When you are reading a function and you encounter 
an undocumented global variable, your reading flow is interrupted. 
What is this variable? Where did it come from? Did I just overlook 
its assignment? Did the programmer just forget to localize it in the 
header? Is it a niladic function and not a variable at all? Is it 
global to this function and assigned outside of it or is it local to 
this function and assigned within a subfunction to which it is global? 


B. Less documentation. To alleviate some of the confusion associated 
with the use of global variables, you should document a global 
variable at two points: in the first few lines of any subfunction 
which requires the global variable; and at the point where any 
subfunction is called which requires the global variable. For 
example, the ADDEMP and CATEMP functions defined above should include 
the following comments: 


VY ADDEMP 


[Co] A Catenate new values and ask for more: 
[o] aA CRequires globals: NUM,NAME,AGE) 

[eo] aA (Modifies globals: ENUM, ENAME, EAGE) 
Lo] CATEMP 


VY CATEMP 
C1] A Called by ADDEMP. 
[2] aA Requires globals: NUM,NAME,AGE 
[3] aA Modifies globals: ENUM, ENAME, EAGE 
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Cc. Fewer localization problems. When a system makes extensive use of 
global variables, it is easier to forget to localize a variable at 
the proper level or to localize it at the wrong level. Poorly 


localized variables can cause some of the most mystifying errors. 


Since the FLF approach has fewer functions then the MSF approach, it 
has fewer subfunction calls and less global variable passing. This 
is a pro for FLF and a con for MSF. 


On the other hand, the MSF approach is easier to employ when you need 
to use a new name ina function. It is easy to scan a small function 
to see whether a meaningful name has already been used. With FLF, 
you may inadvertently re-use the name of a variable still containing 
valuable information. 


6. State Indicator Depth 


When you pick up a system written by someone else (or written by you 
a long time ago), you will most likely start by reading the cover 
function and then each subfunction as it is called. In order to 
retain the meaningfulness of what is going on, you must maintain a 
mental state indicator as you delve into subfunctions. As you finish 
reading each subfunction, you must remember which function called it 
and in what context so that you can flip back to that function and 
continue reading. This process is analogous to the procedure 
followed by the computer as it is executing the code. 


Unfortunately, the human brain cannot maintain its state indicator as 
flawlessly as the computer. At a depth of 5 or 6, our memories get 
flaky. If the current state indicator is not kept on paper, you may 
get lost and have to start again. 


In the MSF approach, the state indicator depth grows very rapidly. 
Even in the tiny application above, it occasionally gets 5 levels 
deep. For example: 


UNLESS[E1] * 
MESSAGETL 2] 
NINPUT(5] 
ADDEMPT[ 1] 
EMPLOYEES 6] 


In an MSF system of any respectable size, the state indicator will 
occasionally get 10 to 15 levels deep and will average 5 to 9 
levels. Such a system is extremely difficult to read until you 
become familiar enough with the subfunctions that you know what they 
do without looking into them (like primitive APL functions). 


The problem of deep state indicators is especially apparent when you 


are called upon to handle an error in an unknown MSF system. The 
first natural step after generating the error is to check the state 
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indicator. If the state indicator is 9 levels deep, the next natural 
step is to go to lunch. 


Having explored some pros and cons of the FLF and MSF approaches, 
which should you use? Typically, neither. The most readable and 
maintainable system is one which employs a moderate number of 
medium-sized functions, each performing a well-defined task. Utility 
functions which are self-contained and well-documented should be used 
generously. 
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PROBLEM: How is the above application invoked? 


TOPIC: Starting an Application 


In deciding how to start an application, you must first decide 
whether or not immediate execution mode will be needed. If so, the 
user should load the application workspace and then execute (from 
immediate execution mode) whatever functions are user functions (e.g. 
EMPLOYEES). 


If immediate execution mode is not needed, which is most often the 
case, you should stay out of that mode until the termination of the 
application. The main reasons to avoid immediate execution mode are 
simplicity and security. The application will be simpler to use if 
the user is prompted for choices rather the having to remember the 
names of functions. The application will be more secure if non-user 
functions cannot accidentally or intentionally be invoked. 


To "autostart" an application, you should assign the system variable 
OLX (latent expression) to be the name of the desired cover function 
before saving the application workspace. For example: 


OLX<+’ EMPLOYEES’ 
JSAVE EMPLOYEES 


To initiate the application, the user only needs to load the 
application workspace. In some installations of APL, even the 
loading step is unnecessary. A workspace may be specified to be 
automatically loaded when APL is invoked. In either case, the 
expression assigned to OLX will be automatically executed. For 
example: 


~185> 
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JLOAD EMPLOYEES 
ADD, DELETE, LIST OR END: 


In this simple application, no special steps need to be taken after 
the workspace is loaded and before the cover function 1s executed. 

In larger applications, this is less likely to be true. Files may 
need to be tied or shared variables activated and global variables 
may need to be assigned. In such instances, the value of OLX is more 
likely to be 'START’ or '>*RESTART’. 


A typical START function will have the following layout: 


[WSID: MSF] 


VY START 
C1] aA Workspace driver function. Used as: 
[2] A 
[3] A OLX<’ START’ 
C4] A 
[5] aA Display any messages. For example: 
C6] Oe’! 
C7] O<’WELCOME TO THE EMPLOYEE MAINTENANCE SYSTEM’ 
C8] He’ ’ 


[9] aA Assign any global variables. For example: 

C10] OPWe150 

[11] fEMP¢345 

[12] a Tie any files or share any variables. For example: 
[13] "EMPDATA’ OFTIE fEMP 

[14] A Read any global variables from file. For example: 
[15] ENUM¢OFREAD fEMP,1 

[16] ENAME¢OFREAD fEMP,2 

C17] EAGE¢COFREAD fEMP,3 

[18] a Call the cover function. For example: 

[19] EMPLOYEES 

[20] A Do any followup work. For example: 

C21} ENUM OFREPLACE fEMP,1 

[22] ENAME OFREPLACE fEMP,2 

C23] EAGE OFREPLACE fEMP,3 

C24) OQOFUNTIE fEMP 


C25] Oe!’ 
C26] O¢’HAVE A NICE DAY’ 
C27] O¢'' 

V 


A RESTART function Cused as OLX¢’?RESTART’) performs the same tasks 
as the START function but handles "line drops" in those APL 
environments which support CONTINUE workspaces. For example, suppose 
you are connected via terminal, modem and telephone line to a remote 
APL system. If the telephone line is interrupted (say due to 
telephone line noise or by accidentally unplugging your modem or 
terminal), the APL system will detect the drop and will save your 
active workspace into a stored workspace named CONTINUE. When you 
next sign on, the CONTINUE workspace will be automatically loaded and 
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installations of APL, you may need to 
manually.) 


its OLX executed. (In some 
load the CONTINUE workspace 


In such an instance, you do not want to run the START function 
again. Rather, you want to redo any steps undone by the drop (e.g. 
retie the files or reshare the variables) and then resume execution 
at the line on which the function at the top of the state indicator 
is suspended. The RESTART function therefore checks the state 
indicator (via DLC) and either executes START if there is no 
suspension or performs restart logic. The final task performed by 
RESTART 1s to explicitly return the line number of the suspended 
function so that execution can resume. 


A typical RESTART function will have the following layout: 
CWSID: MSF] 
V R¢ERESTART 
Workspace driver function and line drop handler. 
Used as: 


OLX<'’>RESTART’ 


Return the line number of any suspended function 
(beyond RESTART): 


| oe | 

fh 

Lae 
>D>DDDD D 


C8] R¢e1lOLC 

[9] A Remove O's from any 2: 

C10] R€CR#0)/R 

C11] A Branch if restart logic is necessary: 

[12] 3(xeR)/L1 

C13] A Otherwise, run START and return R as an empty vector: 
C14] START 

C15] 0 

[16] A Restart logic. Display any messages. For example: 
[17] L1i:O¢'’ 

[18] O¢’EMPLOYEE MAINTENANCE SYSTEM BEING RESTARTED’ 

L219). He? 

[20] A Tie any files or share any variables. For example: 
[21] 'EMPDATE’ UFTIE fEMP 

[22] A Upon exit, line number result will be branched to 
[23] aA Cif OLX¢’»RESTART’) and execution will resume at 
[24] A point of suspension. 
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PROBLEM: Write a niladic function QDOC (quick documentation) which 
will display the contents of each function in the 
workspace, as if you typed Vfnname[0O1V for each function, 
in alphabetic order. 


TOPIC: Function Documentation 


The first step is to determine, under program control, which 
functions exist in the workspace. The system function ONL (name 
list) can be used to this end. When used monadically with the right 
argument 3 (for functions), ONL returns a character matrix of the 
names of the functions in the active workspace, one row per 

function. If your implementation of ONL does not return the names in 
sorted order, sort them (see the Sorting and Searching chapter). 


(Some APL implementations have different system functions for 
returning the names of identifiers. For example, APL*PLUS has 
OIDLIST and SHARP APL has 1 OWS. However, these implementations also 
support ONL.) 


The next step is to display the functions, under program control, one 
at a time. Since APL systems generally do not support an expression 
like ¢'Vfnname[0O1]V', a system function must be used. The APL*PLUS 
system function OVR (visual representation) and the SHARP APL system 
function 1 OFD (function definition) both return a character vector 
"visual representation" of the function whose name is provided as the 
right argument to the system function. The result, when displayed, 
looks exactly like the display produced by Vfnname[01V. This is 
possible because the character vector result contains newline 
(carriage return) characters at the end of each function line 
substring. For example: 


V UNLESS(COI1V 
V R€ELINE UNLESS CONDITION 
C1] R¢CONDITIONI LINE 


CV¢OVR 'UNLESS' 
eCV 
59 
CV 
V R¢ELINE UNLESS CONDITION 
C1] R¢CONDITIONI LINE 


V 
A Replace newline characters by '®’ to see them: 
CVL (CV=OTCNL)/teCVle'®@! 
CV 
V R¢LINE UNLESS CONDITION®@L£ 1] R¢CONDITION! LINE® V® 
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The result of OVR is an empty character vector if the function 
specified is locked. For implementations which provide such visual 
representation system functions, the solution to this problem is 
straightforward: 


CWSID: QDOC] 

V QDOC;3FNS;I;sN;sV;0I0 
[1] a Displays continuous listing of all functions in ws. 
[2] aA Origin 1: 
C3] OTO¢+1 
[4] a Determine functions: 
[5] FNS¢ONL 3 
[6] aA Exclude QDOC from list: 
[7] FNS€CFNSV.#CILo FNS) T’QDOC' )JAFNS 
C8] A FNS€CA/FNSV.74802,1LloeFNS)T2 So'QDOC CRAVR')/FNS aA APL2 
[9] a Sort fn names if not already: FNS¢FNS([OAV4ENS;] 
[10] A Loop on rows of FNS: 
C11] L¢0 
[12] N¢l1toFNS 
[13] LOOP: 9(N<I¢I+1)/0 
[14] V<DVR FNSECI3;] A APL*PLUS 
C15] A Vel OFD FNSCI3;] A SHARP APL 
[16] A V€CRAVR OCR FNSCI;] A APL2 
C17] aA Ignore if function locked: 
[18] 3€xpeV)JLOOP 


C19] O¢Vv 
[20] a Blank line: 
[21] O¢'!' 
[22] ~>LOOP 
V 


Please note that QDOC will not list functions in the workspace whose 
names happen to be the same as any of the local identifiers in QDOC 
(e.g. FNS or LOOP). The system function ONL 3 returns the names of 
identifiers which are functions at the most local level. Since FNS 
is a variable and LOOP is a label at the local level, any global 
function with the same name is "shadowed" and will not be seen. 
Likewise, OVR (or 1 OFD) returns only the visual representation of 
identifiers which are interpreted as functions at the local level. 


For implementations which do not provide a visual representation 
system function, you must work with the "canonical" (matrix) 
representation system function OCR (available in SHARP APL as 2 

OFD). The function OCR returns a character matrix representation of 
the function whose name is provided as the right argument. The 
result has one row per function line Cincluding the header) and as 
many columns as the length of the longest function line. The 
function lines are not numbered, are left justified within their 
respective rows and are padded to the right with blanks. For example: 
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CM¢OCR 'UNLESS' 

oe CM 
2 23 

CM 
R¢LINE UNLESS CONDITION 
R¢CONDITION! LINE 


The result of OCR is an empty character matrix if the function 
specified is locked. 


To write QDOC, we need a function which will convert this canonical 
representation result to the more aesthetic visual representation 
form. The following function will do the trick: 


[WSID: FNREP] 
V VR¢CRAVR CR;OI0;C;3;D;KEEP;L3;N;TCNL 

[1] a Converts canonical representation of fn to visual 
[2] A representation. Return empty vector if CR empty 
[3] aA Clocked fn): 
[4] VR¢e!'! 
C5] »(x/oCRILO 
[6] aA Use origin 1: 
[7] OTO<¢1 
[8] a Construct newline character: 
[9] TCNL¢COTCNL A APL*PLUS 
[10] A TCNL€EOTCL2] A APL2 
C11] Aa TCNL¢GAVEL157] Aa SHARP APL 
Ci2] A 
C13] aA Format header, deleting trailing blanks: 
C14] VR€CRI13] 
[15] VR¢' Vi',C+/V\' '#OVRIEVR 
[16] A Characters which may begin identifiers: 
C17} L¢’ ABCDEFGHIJKLMNOPORSTUVWXYZAabcdefghijklmnopgqrstuvwx 

yzaU' 
C18] aA First character in each line: 
C19] Cé¢CRI 31] 
[20] A Flag comment or label lines: 
[C21] De1llCC='A’ JvCCELIAV/CCR=!’: ' JA<\~\CREL, '0123456789' 
[22] A Drop header and include leading blank column: 
[23] CRe€1 ~1 -eCRITCR 
[24] A De-indent comment or label lines: 
[25] CR€EDOCR 
[26] a Number of lines: 
[271 N€1ToCR 
[28] A Line numbers right justified with right bracket: 
[29] LeCCC3fesN) ,0)S(N,1)etN),']! 
[30] A Line numbers left justified, with both brackets 
C31] A and newline: 
[32] L¢TCNL,'(’,CL+.=' 'J)OL 
[33] a Attach line numbers to lines: 
[34] CReL,CR 
[35] a Flag trailing blanks to drop from each line: 
C36] KEEP¢@v\' '#0CR 
C37] A Squeeze out trailing blanks: 
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V CRAVR Ccontinued) 
C38] CR¢( ,KEEP)/,CR 
(39] A Include header and trailer: 
[40] VR¢EVR,CR,TCNL,’ Vi ,TCNL 
V 


Given the CRAVR function, the QDOC function can be rewritten for OCR 
implementations by replacing the lines: 


FNS¢CFNSV.#CILoFNS)T'QDOC’IFEFNS 
VeOVR FNS[I;] 


in the QDOC function above by the corresponding lines: 


FNS¢CA/FNSV.7&(02,1loFNS)JT2 50'’QDOC CRAVR’' J)4FNS 
VeECRAVR OCR FNSCE I; J 
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PROBLEM: Design a function WSDOC (workspace documentation) which will 
display the entire contents of the workspace. 


TOPIC: Workspace Documentation 


One possible solution to this problem is to design a single 
self-contained function WSDOC which may be copied into the workspace 
to be documented. Since the function is self-contained, it requires 
no subfunctions. Therefore, it may be copied into the workspace or 
erased from the workspace with minimal impact. 


Let's establish the differences between the proposed WSDOC function 
and the QDOC function of the previous section: 


1. The output of WSDOC is paged, not continuous. 


2. WSDOC is monadic. The elements of its integer vector argument 
represent: number of rows per page (Cusually 66), number of 
columns per page (say 85), lines in top margin (say 3), lines 
in bottom margin (say 3), columns in left margin (say 5), 
columns in right margin (say 5). 
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6. 


At the top of each page is a title which includes the workspace 
ID, the current date and time and the page number. For example: 


36150 MODEL * 11/15/1986 15:43 PAGE 4 


The nondefault workspace environment is included at the top of 
the first page. The workspace environment includes the latent 
expression, the index origin, the print precision, the random 
link, the comparison tolerance and any other programmer- 
controlled workspace settings. Only those settings are 
displayed whose current values differ from those in a clear 
workspace. For example: 


NONDEFAULT WORKSPACE ENVIRONMENT: 


OLX¢'START'’ 
OPP¢12 


After the nondefault workspace environment, the global 
workspace variables are listed in alphabetic order, along with 
their shapes and up to one line of their raveled values. For 
example: 


GLOBAL WORKSPACE VARIABLES: 


NAME ¢ SHAPE e VALUE 


CODE ¢« 'X! 
MONTHS ¢ 12 9 9 ‘JANUARY FEBRUARY MARCH...’ 
MSG ¢ 10 9 'THAT''S ALL’ 
TABLE ¢ 2 99 15 9 1.016283 1.11984 1.61582... 
TIE ¢ 368 


After the global workspace variables, the function names are 
listed in alphabetic order. For example: 


FUNCTIONS: 
ADDEMP EMPLOYEES NINPUT UNLESS 
CATEMP IF RCAT 
CINPUT LISTEMP SELECT 
DELEMP MESSAGE SQZEMP 


Finally, the lines of each function are displayed as in Qpoc. 
However, function lines which are too long (for the page width) 
are broken into multiple lines with care taken not to break a 
line in the middle of an identifier or numeric constant. If a 
function will not fit on the remainder of a page, it is started 
on the top of the next page. Functions which are longer than 
one page are broken into multiple pages with care taken not to 
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display a long line on more than one page. At the bottom right 
corner of each page is a footnote which displays the first and 
last functions included on the page. For example: 


IF > SELECT 


8. All local variables and labels within WSDOC are prefixed by AA 
to minimize the number of variables and functions which are not 
recognized because of shadowing. 


Let's list the possible problems we may encounter when using such a 
self-contained WSDOC function: 


1. A WS FULL error may occur if there is insufficient available 
workspace to copy WSDOC. This problem is greater in 
implementations of APL (such as APL2) in which a function is 
copied by moving its canonical (matrix) representation. The 
Canonical representation of WSDOC is quite large if it has a 
lengthy header. To get around this problem, you may remove all 
local variables from the header and erase all variables 
beginning with 'AA'’' on the last line of the function. 
Alternately, you may load the WSDOC workspace and copy the 
workspace to be documented. However, bear in mind that the 
nondefault system variables will not be copied. 


2. A SYMBOL TABLE FULL error may occur if there are insufficient 
available entries in the symbol table for the local variables 
and labels in WSDOC. 


3. Another object which happens to be named WSDOC will be erased 
and replaced by the WSDOC function when it is copied into the 
workspace. 


Tf your APL implementation does not support a visual representation 
system function (e.g. OVR or 1 OFD), you will also need to copy in 

the function CRAVR. In this event, WSDOC requires CRAVR and is not 
strictly self-contained. 


The writing of WSDOC is left as an exercise at the end of the chapter. 


The use of this WSDOC function is a simple way to get a neat and 
thorough listing of the contents of your workspace. If your 
workspace documentation requirements go beyond the capabilities of 
this function, you may want to acquire a more comprehensive workspace 
documentation software package available from your APL vendor. Such 
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packages typically include capabilities for listing cross-reference 
information. For example, you can list the functions in which each 
workspace identifier is used, or list the identifiers used within 
each function, or display a "tree" diagram which shows which 
functions are called by other functions, and so on. 


PROBLEM: A function named IDENTIFY analyzes the visual representation 
of a function to determine which identifiers are used 
within the function and how. It then displays any known 
or potential errors or inconsistencies (e.g. assigning a 
value to a name which is also used as a label). Make a list 
of all such errors or inconsistencies. 


TOPIC: Function Identifiers 


The IDENTIFY function as described above is useful for a final 
validation on any function you have written, especially a large 
function. After using IDENTIFY, it is a simple matter to edit the 
function to correct the reported problems. 


Here are the problems and illustrations: 


1. Redundant label. 
C3] L6:A€¢35 


C17] L6:Q¢Bx*2 


2. Unused identifier localized. 
VY MODEL;A 
CA is not mentioned anywhere in MODEL, though it may be 


used within a character constant argument to ¢# or ina 
subfunction called by MODEL) 
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3. Identifier localized but not assigned. 


V MODEL;A 


[9] Q<ALK 
(A is not directly assigned, e.g. A¢B+2, anywhere in MODEL, 


though it may be assigned within a character constant 
argument to ¢@ or in a subfunction called by MODEL) 


4. Redundant local variable. 
V R€¢MODEL A;B;R 


or 
V MODEL: A;B;A3;R 


5. Localized label. 
V MODEL; LOOP;I 


C8] LOOP: »CLIM<I)/END 


6. Unused label. 


VY MODEL 


C6] L4:Ke2+B 
(No reference is made to L4, e.g. ®L4 or 9(L3,L4,L5)[1I], 
anywhere in MODEL, though it may be used within a character 


constant (e.g. #(T>0)/'»L4') or may be...gasp...referenced 
in a subfunction called by MODEL) 


7. Assigned label. 
V MODEL 
C3] END€7 99 


C25] END: 0¢V 


8. Identifier assigned but not localized. 


VY MODEL;A;J 


C7] BetoJd 


(B is not localized in MODEL, though it may be localized in 
a function which calls MODEL) 
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9. Identifier used and not localized. 
VY MODEL;A 
[6] A¢2+B 
(B is not localized in MODEL, though it may be localized in 


a function which calls MODEL or it may be a subfunction 
required by MODEL) 


10. Result not assigned. 
V REMODEL PARAMS 


(CR is not assigned in MODEL, though it may be assigned 
within a character constant to 2 or ina subfunction called 
by MODEL) 


11. Argument not used. 
VY R¢MODEL PARAMS 


(PARAMS is not used in MODEL, though it may be used within 
a character constant argument to ¢ or in a subfunction 
called by MODEL) 


The task of writing the IDENTIFY function is beyond the scope of this 
chapter. It is included as a problem in the Boolean Techniques 
chapter. IDENTIFY is monadic. Its right argument is the visual 
representation of the function to be analyzed. For example, to 
analyze the function MODEL, do the following: 


IDENTIFY OVR 'MODEL' in APL*PLUS 
IDENTIFY 1 OFD ‘MODEL’ in SHARP APL 
IDENTIFY CRAVR OCR 'MODEL'’ in another APL system 


CCRAVR is defined earlier in this chapter.) 


There are three related functions also developed in the Boolean 
Techniques chapter which warrant mention here. They are: RELABEL, 
LOCALIZE and UNCOMMENT. The right argument of each function, like 
IDENTIFY, is the visual representation of a function. The result of 
each function is a modified version of the visual representation, 
modified to accomplish a particular task. The functions are 
described below. 
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Syntax: NEWVR¢LABLIST RELABEL OLDVR 


The RELABEL function changes the labels in the visual representation 
so that they become Ll, L2, L3 and so on. In many functions, 
expecially large ones, labels serve simply as branch targets for 
downward flowing logic. It is difficult and pointless to think up a 
meaningful name for each label. It is more convenient to the reader 
to have the labels sequentially numbered so that they can be quickly 
located. Some labels, however, are best left as meaningful names 
(e.g. LOOP, END, START, CALC). The left argument LABLIST is a 
character vector of the names of the labels (separated by spaces) 
which are not to be renamed. Provide an empty character vector left 
argument (i.e. '’') if all labels are to be renamed. 


Do not use RELABEL on any function which contains local variables Ll, 
L2, and so on. Otherwise, these names will refer to both labels and 
local variables. The resulting function will no longer work 
correctly. 


RELABEL ignores all identifiers within quotes so some labels may not 
be modified as desired. For example, in the expression, 


ELX¢'>BELOW’ 


the reference to the label BELOW will not be detected and modified. 
To handle this potential problem, you may choose to write such 
expressions in the following way: 


ELX¢’'>' , 5BELOW 


Likewise, the names of labels included in comments are not detected 
by RELABEL. You should avoid placing labels in comments. For 
example, 


use: A Branch if quota exceeded 
not: Aa Go to L17 if quota exceeded 


RELABEL does not correct any of the problems with labels listed by 
IDENTIFY. 


Syntax: NEWVR¢VARLIST LOCALIZE OLDVR 


The LOCALIZE function changes the local variables in the header of 
the visual representation so that the header includes only those 
variables which are assigned within the visual representation. The 
LOCALIZE function tends to correct problems 2, 3, 4, 5 and 8 listed 
by IDENTIFY. Some variables, however, are assigned within a function 
but should be left global or are not assigned (i.e. are assigned in 
subfunctions) but should be localized. The left argument VARLIST is 
a character vector of the names of variables (separated by spaces) 
which are to be included in the header if not assigned or are to be 
excluded from the header if assigned. Provide an empty character 
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vector left argument (i.e. '') if all and only the assigned variables 
are to be localized. The localized variables will be included in the 
header in alphabetic order. 


Syntax: NEWVR¢UNCOMMENT OLDVR 


The UNCOMMENT function removes all comments from the visual 
representation. End-of-line comments are removed completely, 
including the comment symbol (A). The comment symbols which precede 
full-line comments are not deleted so that all function lines remain 
and do not renumber. Strangely, the UNCOMMENT function allows you to 
include more comments in functions you write. One argument for 
omitting or skimping on comments is that comments use up valuable 
workspace. The UNCOMMENT function allows you to write one set of 
functions which contain extensive comments (the "maintenance 
version") and another set which is functionally equivalent but 
contains no comments (the "production version"). 


Since the functions RELABEL, LOCALIZE and UNCOMMENT each require a 
Visual representation right argument and each return a visual 
representation result, they may be "chained" together to perform 
several functions at once. For example: 


NEWVR¢' LOOP’ RELABEL '' LOCALIZE UNCOMMENT OVR '/MODEL’ 


However, the visual representation of a function is of little value 
to you unless you can convert it back into a function. Some APL 
systems have a system function which will do this directly (QODEF in 
APL*PLUS and 3 OFD in SHARP APL). The right argument of the system 
function is the visual representation of a function and the result is 
the character vector name of the function defined. 


Therefore, to relabel a function named MODEL: 


N¢ODEF '’ RELABEL OVR ’/MODEL’ in APL*PLUS 
N¢3 OFD '’ RELABEL 1 OFD ‘/MODEL' in SHARP APL 


For APL implementations which do not have such a system function, you 
must use the system function OFX (fix). The right argument to OFX is 
the canonical (1.e. matrix) representation of a function (as returned 
by OCR) and the result is the character vector name of the function 
defined (fixed). 


Our task then is to write a function VRACR which will convert the 
visual representation result of RELABEL, LOCALIZE or UNCOMMENT into a 
canonical representation so that the function may be defined via 

OFX. Given the VRACR function, we may relabel a function named MODEL 
as follows: 


NeQFX VRACR '' RELABEL CRAVR UCR 'MODEL'’ 
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The following VRACR function will perform the necessary conversion. 


CWSID: FNREP] 

V CREVRACR VR;SOI0O3;B3C3;D3;1;LEN;NL;3R;TCNL 
[1] a Converts visual representation of fn to canonical 
[2] A representation. Return empty matrix if VR empty 
[3] aA Clocked fn): 
[4] CRE 0 O o'! 
C5] >+CeVR)LO 
[6] aA Use origin 1: 
C7 J OlO¢+1 
[8] aA Construct newline character: 
C9] TCNL¢OUTCNL A APL*PLUS 
[10] A TCNL€OTCEL2] aA APL2 
[11] A TCNL€¢QAVEO157] A SHARP APL 
[12] A 
[13] A Select header line (less newline): 
[14] CReCI€°1+VRiLTCNL)oeVR 
[15] A Drop off header: 
[16] VR€ILVR 
[171 A Delete leading V and spaces from header: 
[18] CR€eC+/aA\CRE’' V'IJICR 
[19] A Locate newlines which precede and follow 
[20] A each line: 
[21] NL€¢VR=TCNL 
[22] a Flag starts and ends of contiguous digits 
C23] aA (e.g. line no.s): 
[24] DeVRE'0123456789' 
[25] DeD#(eD)e0,D 
[26] A Flag char following '1]' after line no.: 
[27] De 1¢0D\"-10D/ “2ONL 
[28] A Flag starts and ends of contiguous blanks 
[29] a Ce.g. after line no.s): 
[30] B€eVR=' ' 
[31] Be¢B#(eB)00,B 
[32] A Flag first nonblank char in each line, as indices: 
C33] DeCD>BIVB\'-10B/D 
[34] D¢D/rieD 
[35] A Compute lengths of lines: 
[36] LEN¢C1il 1INL/iLeNLI-D 
[37] A No. of columns in result: 
[38] CéeCeCRITT/LEN 
[39] A No. of rows in result: 
[40] Re1+oLEN 
[41] aA Initialize result as raveled matrix: 
C42] CRECRxC)TCR 
[43] a Construct index vector I€(tLEN[1]),(LLEN[21),...: 
[44] J«¢LEN/-"110,+\LEN 
[45] I€I+voI 
C46] aA Insert fn lines into raveled result: 
[47] CRLI+LEN/CXlLpLENI¢VRLI+LEN/~1+D) 
[48] A Reshape result to matrix: 
[49] CRE(R,C)eCR 

V 
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This function borrows a number of the techniques discussed in the 
Boolean Techniques chapter. See that chapter for clarification. 


PROBLEM: Design a monadic function USEDBY whose argument is a list of 
functions (character matrix with one name per row or a 
character vector with names delimited by spaces) and which 
shows all subfunctions and global variables required by 
those functions. 


TOPIC: Workspace Identifiers 


When you inherit the maintenance of an APL application, there are 
three pieces of documentation which are invaluable to your 
comprehension of the system. They are: 


1. Function listings. If missing, you can reconstruct them by 
using the WSDOC function defined in this chapter. 


2. File structure documentation. If missing, you can hopefully 
reconstruct it by displaying the data from the files and by 
inferring meaning from the context in which the files are used 
(by reading the function listings). 


3. System flow charts. If missing, you can hopefully reconstruct 


them by running the USEDBY function on the user level functions 
and by reading the function listings. 


The following is an illustration of USEDBY on the EMPLOYEES function 
of the MSF workspace listed earlier in this chapter. 


~200- 


Chapter 13 WORKSPACE DESIGN AND DOCUMENTATION 


USEDBY 'EMPLOYEES ' 
EMPLOYEES 


UNLESS 


MESSAGE 
UNLESS 
IF 
UNLESS 
ENUM (global) 
CINPUT 
MESSAGE 
UNLESS 
CATEMP 
ENUM (global) 
NUM CADDEMP - local) 
ENAME (global) 
RCAT 
NAME CADDEMP - local) 
EAGE (global) 
AGE CADDEMP - local) 
DELEMP 
IF 
ENUM (global) 
SQZEMP 
ENUM (global) 
ENAME (global) 
EAGE (global) 
LISTEMP 
ENUM Cglobal) 
EAGE (global) 
ENAME (global) 


The USEDBY function does pretty much what you would do to manually 
diagram the subfunction and global variable structure of a system. 

It starts by evaluating the visual representation of the highest 
level function for global identifiers referenced (i.e. all 
identifiers but labels, results, arguments or localized variables). 
For those global identifiers which are themselves functions, it 
evaluates each one in the same fashion. And so on it recurses deeper 
or less deep in the fashion of the state indicator during execution 
of the system. 


The writing of USEDBY is left as an exercise at the end of the 
chapter. 


The USEDBY function is extremely powerful and quite complex. It 
draws heavily on the materials presented in the Boolean Techniques 
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chapter. It may help you to read that chapter before writing the 
function or reviewing the solution. 
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PROBLEMS: (Solutions on pages 367 to 382) 


1. Design other "visual representation manipulation" functions which 
may be useful. Pattern their syntax and behavior after the 
IDENTIFY, RELABEL, LOCALIZE and UNCOMMENT functions described in 
this chapter. 


2. Sketch a flowchart of the WSDOC function described in this 
chapter. Compare it to the flow of the WSDOC function listed in 
the solutions at the back of the book. 


3. Sketch a flowchart of the USEDBY function described in this 
chapter. Compare it to the USEDBY function listed at the back of 
the book. 
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FILE DESIGN AND UTILITIES 


When APL was first implemented, the language included no file 
capabilities. This shortcoming was quickly recognized as an obstacle 
to the acceptance of APL as a viable business programming language. 
Two approaches were taken to overcome the obstacle. 


In one approach (shared variables), facilities were developed (USVO, 
OSVR, ...) to provide access to existing non-APL file structures. 
From APL, you can do anything with files that you can do from another 
programming language. While this approach enables the APL user to 
communicate with non-APL environments, it leaves the APL purist 
unsatisfied. It is difficult and disappointing to the APL programmer 
to work with the concise and consistent APL primitive functions on 
the one hand and the messy world of records, tracks, blocks, 
cylinders and disks on the other hand. 


In the other approach (shared files), facilities were developed 
(OFCREATE, OFREAD, ...) to provide access to APL file structures. [In 
the spirit of APL simplicity, an APL file was defined as a list of 
APL objects residing outside of the active workspace. A file can 
have any number of objects (called "components") and each object can 
be of any type, rank or shape. The components are numbered 
consecutively from 1. A large object can replace a small object 
without the APL programmer knowing or caring how the storage is being 
managed on the physical storage device. 


If you want to work with APL files but your APL implementation 
supports only shared variables, look for a public library workspace 
which uses shared variables to emulate APL files (e.g. the IBM 
workspace 2 VAPLFILE). 


In this chapter, we will discuss some of the more common APL file 
organizations and the trade-offs between them. We will also discuss 
the value of quality file documentation and file utility functions. 
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PROBLEM: Suppose you want to build an APL system for maintaining a 
database of information about insurance policyholders. For 
each policyholder, you will keep track of: policy number, 
issue age, issue date, sex, classification and face 
amount. Design a file organization for this application. 


TOPIC: APL Database File Organization 


The ideal file organization for a given application depends upon the 

size of the database and upon how it is used. Since this information 
is missing in the description of the problem, we will present several 
alternative organizations. In the next section, we will discuss the 

factors to consider when deciding among these organizations. 


For simplicity, we will assume that all of the policyholder 
information may be expressed as numbers (e.g. sex as 0 or 1). We will 
refer to the items by the abbreviations:  POLNO, IAGE, IDATE, SEX, 
CLASS, AMT. 


The descriptions of 8 alternative APL file organizations follow: 


1. Record Oriented 


Comp 

No. Description 
z POLNO, IAGE, IDATE, SEX, CLASS, AMT for 1st policy 
2 f tt iB] f tt f te 2nd tt 
3 ? *! ? tt tf tf tt 3rd tf 

2. Record Oriented with Deletion Flag 

Comp. 

No. Description 
1 STATUS, POLNO, IAGE, IDATE, SEX, CLASS, AMT for lst policy 

(STATUS=1 if active record, 0 if "deleted') 

2 ee tt tf *? t? f a] t 2nd +f 
3 tf 't ? ? #t ? tt ? 3rd tt 
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3. Directory 


Comp 
No. Description 
1 Numeric vector (directory) of POLNO for every policy 
2 IAGE, IDATE, SEX, CLASS, AMT for lst policy (whose 
POLNO is the lst element of the directory) 
3 tt '" 't tt ot for 2nd policy tt ! 
I tf ft tf t? tf tf (T-lL)th ft ft tf 


4. Directory with Deletion Flag 


Comp. 
No. Description 
1 Boolean vector of STATUS for every policy (STATUS=1 if 
active record, 0 if '"deleted") 
2 Numeric vector (directory) of POLNO for every policy 
3 IAGE, IDATE, SEX, CLASS, AMT for 1st policy (whose 
POLNO is the 1st element of the directory) 
4 ? tt tt tt ? for 2nd policy tt tt " 
I ¢ f rf 'f tt (T-2 Ind tf f ‘? tt 


5. Transposed 


Comp. 
No. Description 
1 Numeric vector of POLNO for every policy 
2 a] tt a LAGE 9 ? tf 
3 ft f ft IDATE vf ‘? tf 
4 ft ve +? SEX tf ve tt 
5 '? tt tf CLASS tf tt tt 
6 " A tf AMT +? tf t 
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6. Transposed with Deletion Flag 
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SOO PB WD 


FILE DESIGN AND UTILITIES 


Description 


een ee ee ee 


Boolean vector of STATUS for every policy 
(STATUS=1 if active record, 0 if "deleted") 


Numeric vector 


? 


¢ 


A 


tt 


rf 


tt 


7. Multi-Set Transposed 


(1+6xI-1) 
(2+6x1I-1) 


* 
e 


Numeric vector 


" 


Numeric vector 


" 


Numeric vector 


A 


‘4 


*! 


tt 


of 
¢ 


IAGE 
IDATE 
SEX 
CLASS 
AMT 


ft 


t 


of POLNO for every policy 


't 


Description 


— ee ee ee eee ee ee ee ee eee ea eee ie 


POLNO 
IAGE 
IDATE 
SEX 
CLASS 
AMT 


POLNO 
IAGE 


POLNO 
IAGE 


A 


for 2nd 2000 policies 
f 


" * 


for Ith 2000 policies 


9 
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8. Multi-Set Transposed with Deletion Flag 


Comp. 
No. Description 
1 Boolean vector of STATUS for ist 2000 policies 
(STATUS=1 if active record, 0 if "deleted") 
2 Numeric vector of POLNO for 1st 2000 policies 
3 ? tt tf LAGE tt ? tt tf 
4 e? ‘1 ? ITLDATE ‘ft +? tf tf 
5 tt ? ft SEX tt tt tf ‘f 
6 tf t tt CLASS ‘f tt tf A 
7 ! tf tf AMT +? tf tf +? 
8 Boolean vector of STATUS for 2nd 2000 policies 
9 Numeric vector of POLNO for 2nd 2000 policies 
10 ? ! f LAGE rt tf tf tt 
(1+7xI-1) Boolean vector of STATUS for Ith 2000 policies 
(2+7xI-1) Numeric vector of POLNO for Ith 2000 policies 
A] f ITAGE tf a *? ‘f 


(3+7xI-1) ie 


This list of file organizations is not exhaustive. It merely 


illustrates some typical APL file organizations. 


At the two extremes are the record oriented and the transposed 
organizations. The directory organization is a hybrid of the two. 
The multi-set transposed organization is a modification of the 
transposed organization designed to avoid WS FULL errors when working 
with large databases. 


The "deletion flag" alternative exists for any file organization. 
When you need to delete records from a database, you have two 
alternatives: delete the record now (shifting other records if 
necessary to fill the void); or flag the record to be deleted but do 
not delete it until later (by a procedure which restructures the file 
to remove all flagged records or by the gradual process of replacing 
flagged records by new records as they are added). 
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PROBLEM: List the factors to consider when choosing among 
alternative APL file organizations. 


TOPIC: File Design Considerations 


To choose among file organizations, you must know how much 
information is to be stored and how it is to be used. For each type 
of task, consider how well each file organization will stand up to 
the demands made upon it. In particular, ask yourself: 


1. How may file accesses (i.e. read or write operations) will be 
required? These take time. 


2. How CPU efficient will the task be? Does the organization require 
significant amounts of processing? 


3. How efficient will the task be in terms of workspace storage? Are 
WS FULL errors likely? 


4. How complex is the file structure? Will the programs be difficult 
to write, to read and to debug? 


5. Is redundant file storage required? If so, might the file become 
excessively large? Could its data get out of synch? 


The following is a list of representative tasks which are performed 
on databases. When choosing a file organization, consider each 
task. Will this task be performed in this application? How often? 
How well is it performed in this file organization given the 
performance measures suggested above? 


Add 1 record 

Add 100 records 

Find/change 1 record (1 item) 
Find/change 1 record (all items) 
Find/change 100 records (1 item) 
Find/change 100 records (all items) 
Find/delete 1 record 

Find/delete 100 records 

Find/list 1 record (5 items) 

10. Find/list 1 record (all items) 
11. Find/list 100 records (5 items) 
12. Find/list 100 records (all items) 
13. Summarize all records (1 item) 
14. Summarize all records (10 items) 


ODN AM FP WNP 


The chart below rates the 8 file organizations presented in the last 
section for each of these 14 tasks. The letters A Cexcellent) to F 
(horrible) are used for rating. These ratings are subjective and 
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will vary from application to application but this chart is a good 
guideline. 


File Organization 


emer ere 


Add 1 record 

Add 100 records 

Change 1 record, 1 item 
Change 1 record, all items 
Change 100 records, 1 item 
Change 100 records, all items 
Delete 1 record 

Delete 100 records 

List 1 record, 5 items 

List 1 record, all items 

List 100 records, 5 items 
List 100 records, all items 
Summarize all records, 1 item 
Summarize all records, 10 items 


AAA AAA Aya PI re 
PARED eae PP in 
AAWOAPPrOWWOY,P PPP Iw 
AMD PrP PrP Pwd Pos! A 
PrPOWWOAWWDWDWWPrPwOPrrni! uo 
rPrPUMWOAWAPrrHnWWndWdWwo! 
PrPWWONOWWWWPwoPrPn i! 
PrPUWWOMPr POW WWWo! wo 


Several conclusions may be drawn from this chart: 


a 


If you intend to do much summarizing or cross-tabulating, you 
should choose a transposed file organization. 


Unless you intend to add records and do nothing else, you should 
avoid a record oriented file organization. 


No file organization is ideal for all tasks. The best file 
organization is frequently the one which has the fewest and least 
severe shortcomings rather than the most strengths. Sometimes a 
hybrid organization will be the best solution for a given 
application. 
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PROBLEM: For a 1,000,000 record APL database, it is critical that a 
specified record (e.g. policy) be located instantly. How 
would you organize the file? 


TOPIC: Efficient Record Location 


The record oriented file organization is out. We do not have the 
time to do up to 1,000,000 file read operations. Even the multi-set 
transposed file organization has problems. If blocked at 2000 
records per set of components, there will need to be up to 500 file 
read operations. That is fine for ad hoc file analyses but is 
unacceptable for instant access. 


To solve this problem, we need to utilize the information contained 
within the key value (record identifier) itself. For example, 
suppose our records are insurance policies and the record identifier 
1s a policy number. We must use a portion of that number to get us 
quickly to the vicinity in which the record is located. Consider the 
following "inverted" directory file: 


Comp. 
No. Description 
1 Two-row matrix with one column per policy whose policy 
number ends with 000: 
[1;] policy number 
[2;] record index where policy data are stored 
2 Ditto for policies ending 001 
3 Ditto for policies ending 002 
1000 Ditto for policies ending 999 


This file could be a companion file for any of the file organizations 
discussed above. The meaning of "record index" depends upon which 
file organization is used. For example, if the record oriented file 
organization is used, the record index can simply be the number of 
the component in which the record is stored. If a transposed file 
organization is used, the record index can be a number whose format 
is SSSSIIII where SSSS is the number (index) of the set of components 
in which the record resides and IIII is the exact index within the 
components of that set where the record is located. 


Given this directory file, any one of the 1,000,000 policies may be 


located with a single file read operation. For example, to find 
policy 613821904, read component 905 (1.e. 1+904) of the directory 
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file and search its first row for this number. The corresponding 
element of the second row contains the record index for the policy. 


The term "inverted" is used to refer to a file which stores record 
indices (Cor pointers) rather than data values. The trade-off for 
realizing such rapid record location is that the directory must be 
set up initially and must be updated as records are added or deleted 
Cor their policy numbers changed). This will slow down the record 
maintenance process somewhat and will make it more complex. 
Consequently, such a directory should be included only if essential. 


An alternative to the inverted file organization is the "layered" 
file organization. Suppose the file is layered by the last three 
digits of the policy number. Rather than maintaining a list of the 
record indices for each possible value (000 to 999), the records are 
physically segregated by the values. For example, all records whose 
policy number end with 904 are kept together on file. 


This "layering" is fairly easily accomplished with the multi-set 
transposed file organization. Each set of components contains 
records for only a single layer value. For example, the first set of 
components could contain the information for policies whose policy 
number ends with 625, the second set with 904, the third set with 
707, and so on. If there are more policies with numbers ending in 
904 than you can place in one set, use more than one set for the 
records with that layer value. 


Suppose you employ a multi-set transposed file organization blocked 
at 2000 (maximum) records per set of components and layered by the 
last three digits of the policy number to store the 1,000,000 records 
discussed above. On average, each set will contain 1000 records. 
Some more. Some less. If any layer value (e.g. 000) is so popular 
that it belongs to more than 2000 records, the records of that layer 
will occupy more than one set. A directory of layer values is 
maintained as a vector with one element per set and is stored as a 
single component of the file. 


Given this file organization, any one of the 1,000,000 policies may 
be located with 2 or 3 or so file read operations. For example, to 
find policy 613821904, read the layer values vector and search it for 
904. The matching element(s) identify the set(s) whose records have 
policy numbers ending with 904. Then, read the policy number 
component for that set Cor sets) and search for the policy number. 
The result is the index within the set at which that record is 
located. 
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PROBLEM: What should be included in written file documentation? 


TOPIC: File Documentation 


Since a file has no value except as employed in an application, its 
documentation should be couched in terms relevant to the 
application. For example, if the file is activated by "tying" it to 
an arbitrary number, show the tie number which is actually used in 
the application. 


When components are read into the active workspace from the file, 
they may technically be assigned to any variable name. Show the 
names which are actually used in the application. When describing a 
file component, indicate its shape and type and the significance of 
its value. 


The following is an illustration of proper file documentation. Do 
not waste your time studying its intricacies. Rather, use it to 


become comfortable with the general structure of good file 
documentation. 


FILE NAME: POLICY TIE NUMBER: 321 


DESCRIPTION: Contains policyholder information 


Comp 
No. VARIABLE DESCRIPTION 

1 TDATE Integer scalar of the transaction date 
CYYYYMMDD) when policyholder information 
was last added to the file from the 
administration system. 

2 CTYPES Integer vector of the available 
underwriting classification codes. 

3 CNAMES 10 column character matrix of brief names 
for the underwriting classes; the rows of 
CNAMES are in 1-to-1 correspondence with the 
elements of CTYPES. 

4 FIV Field identification vector. Integer vector 


with one element per field of information on 
file (e.g. policy number, issue age, sex, 
eee). The value indicates the type of array 


Heo 
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6 ARPS 

7 RPS 
8-10 Clatent) 


F+10+FP(3)]xS-1 


The fields of data are: 


FIELD 
NO. (CF) 


VARIABLE 
NAME 
STATUS 
POLNO 
IAGE 
IDATE 
SEX 
CLASS 
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used to store the information: 


10: Deletion flag Boolean vector (1: 
active record) 
11: Boolean vector 
12: Character vector 
13: Integer vector 
14: Floating point vector 
nn2: Character matrix with nn columns 


File parameters vector. 


FPL1]: Number of active records 

FP(2]: Number of records (including 
deleted) 

FP[(3]: Number of fields (i.e. oeFIV) 

FP£4]: Maximum number of records, per set 
of FP[3] components 

FP£5]: Number of sets of FPI[3] components 


Active records per set. Integer vector with 
one element per set (FP[5]) of the number of 
active (not deleted) records per set. Note: 
C+/ARPS)=FP[1] 


Records per set. Integer vector with one 
element per set (FP[5]) of the number of 
records (including deleted) per set. Note: 
(+/RPS)=FPC2] 


Empty numeric vector 


Array of data for field F (1 to FP{£3]) in 
set S (1 to FP[5]). The type and rank of 


this array is defined by FIVIF]. The length 
of its first dimension is RPS([S]. 
V DESCRIPTION 
Deletion flag (l=active; O=deleted) 
Policy number (up to 13 digits) 
Issue age (NN) 
Issue date (YYYYMMDD) 
Sex ('M' or 'F’ or ' ' if unknown) 
Underwriting classification code 
Can element of CTYPES) 
Face amount (cents) 
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If you inherit the maintenance responsibility for an application 
system which has no file documentation, your first task is to 
reconstruct the file documentation. This is usually possible by 
carefully reviewing the functions which access the files and by 
reviewing the file components themselves. To display the functions 
for your review, use the QDOC or WSDOC functions defined in the 
Workspace Design and Documentation chapter. 


To display the file components, use the FILEDOC function described 
here. The right argument is the same as that of WSDOC: page height, 
width, top margin, bottom margin, left margin, right margin (e.g. 66 
85 3 3 10 5). The left argument identifies the file to be documented 
(e.g. file tie number). The output is paged and looks like: 


FILE: 21368 POLICY (521 COMPONENTS) * 11/27/86 12:06 PAGE 1 


COMPONENT SHAPE 0 VALUE 


1 19861025 

2 11 op 31 32 33 34 41 42 42 51 52 53 99 

3 1110 © ‘STANDARD P38K4 P39K4 Pees: 

4 7 © 10 14 13 13 12 13 13 

5 5 eo 801625 801643 7 2000 £401 

6 401 © 2000 1998 2000 2000 2000 2000 1995 2000... 

7 401 9 2000 2000 2000 2000 2000 2000 2000 2000... 
8-10 0» 10 

11 2000 ep 13156281325 21065134890 21065338190... 


COMPONENTS 1 TO 67 


The writing of FILEDOC is left as an exercise at the end of the 
chapter. 
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PROBLEM: Design a set of utility functions for accessing APL files. 
The functions should be intuitive (i.e. be analagous to APL 
primitive functions) and should have a syntax which is 
independent of the chosen file organization and independent 
of your implementation of APL files. 


TOPIC: File Utility Functions 


By designing and using such a set of file utility functions, you can 
become more productive. The procedure for working with files becomes: 


1. Design the file organization for a given application; 
2. Implement these utility functions for the given file organization; 


3. Use the utility functions instead of primitive (e.g. OFREAD or 
OREAD) file access functions. 


By using a consistent set of utility functions, you can work on many 
application systems without having to continually reorient yourself 
to the different file organizations. In effect, the utility 
functions shelter you from the intricacies of each file organization. 


Below is a recommended set of file utility functions. In each 
function, the variable FP (file parameters) represents a numeric 
scalar or vector which distinctly identifies the file to be used 
(e.g. file tie number or tie number and blocking factors). If your 
application deals with just one file and its organization is 
sufficiently unusual that you are unlikely to need these functions 
for a similar file, you may omit the FP argument altogether. 


The philosophy behind the design of these file utility functions is 
to treat the file as a matrix in your workspace. The rows of the 
matrix are called "records". The columns are called "fields". Each 
utility function emulates some common matrix operation. For example, 
imagine the data stored in a workspace matrix named FILE. A common 
matrix operation is adding new records: 


FILE¢FILE, (1JINEWDATA 
The corresponding file utility function has the syntax: 
FP¢FP CATREC NEWDATA 


Along with the syntax of each utility function is listed the 
analogous APL expression for operating on a matrix named FILE. 


There are two important distinctions to keep in mind. One is that a 
field does not need to represent a vector of data. It may be a 
matrix. For example, a field of employee names may be stored on file 
as a 20 column character matrix. Here, we treat the names as a 
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Single field. Hence, the analogous APL expression will treat them as 
a single column. 


The second distinction is that "record indices" means values that are 
understood by the utility functions to identify particular records. 
They do not necessarily mean t indices. For example, for a multi-set 
transposed file organization, a single record index might be a two 
element vector whose first element is the index of the set and whose 
second element is the array index within the set. 


It is not expected that you will implement all of the following 
utility functions. Rather, you should select those most useful for 
the application and implement then. 


(STSC’s File Manager product -- originally marketed as "EMMA" -- 
provides a comprehensive set of such file utility functions for an 
APL*PLUS-based multi-set transposed file organization. ) 


SYNTAX: FP INITFILE FT 
FILE¢FPoFT 


INITFILE is used to build an "empty" file whose file parameters are 
FP and whose field types are defined in FT. 


SYNTAX: FP€FP CATREC MAT 
FILECFILE, [11IMAT 


CATREC is used to add (catenate) records to the end of the file. MAT 
is a matrix of information to be catenated (or inserted into records 
flagged for deletion). Each row of MAT represents a single new 
record. Each column of MAT corresponds to a single field, or column 
of a matrix field, of data (excluding the deletion flag field, if 
any). If MAT is a vector, it is treated like a one-row matrix. If a 
scalar or one-element array, it is catenated to the bottom of each 
field as a single record. The result is the modified value of FP. 


SYNTAX: FP¢€FP CATRECWS NREC 
FILE¢FILE,£1)]F1,F2,F3,... 


CATRECWS is used to add (catenate) records to the end of the file. 
NREC is an integer scalar whose value represents the number of 
records to be catenated (Cor inserted into records flagged for 
deletion). The data for these new records are located in the global 
field variables Fl, F2, F3, ... where Fl is a vector of values for 
the first field of the record (or a matrix with one row per record), 
F2 is for field 2, F3 is for field 3, and so on. One variable is 
required per field Cexcluding the deletion flag field, if any). 
Regardless of the magnitude of NREC, any field variable may be a 
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one-row matrix, a vector with one element per column of the matrix 
field, or a scalar or one-element array. The data will be reshaped 
and catenated to the bottom of the field for NREC records. The 
result is the modified value of FP. The field variables are erased 
upon successful completion of the function. 


SYNTAX: RINDS¢CFP,KFLD) IOTA VALUES 
RINDS¢FILEL ;KFLD]LVALUES 


IOTA searches through the file Cignoring deleted records) for the 
first occurrences of records whose key value (e.g. policy number, 
employee number, transaction number) 1s specified in VALUES. IOTA 
behaves like dyadic 1. That is, its result contains one index per 
value in the right argument, in 1-to-1 correspondence. The elements 
of RINDS are record indices which can be used to directly locate the 
records. The elements of RINDS are “1 for those elements of VALUES 
not found. The left argument of IOTA may be just FP if there is only 
one key Cidentifying) field. Otherwise, the number of the key field 
(KFLD) is included in the left argument. 


SYNTAX: RINDS©IOTARHO FP 
RINDS€¢L1ToFILE 


IOTARHO returns the record indices of all active (not deleted) 
records in the file. IOTARHO behaves like monadic to. That is, its 
result contains all of the indices for the specified array (file). 
The elements of RINDS are record indices which can be used to 
directly locate the records. 


SYNTAX: RINDS¢SVEC SLASHIOTARHO FP,SFLDS 
RINDS¢€C@SVEC)/L1ITe FILE 


SLASHIOTARHO returns the record indices of all active (not deleted) 
records in the file which satisfy a specified criterion. SVEC is a 
character vector APL expression (e.g. '(F3>50)AF2#0’) which defines 
the desired criterion. The expression is stated in terms of field 
variables Fl, F2, F3, ... which represent the data stored in the ist, 
2nd, 3rd, ... fields of each record. The expression should be 
constructed such that when executed, its result is a Boolean vector 
with one element per record (i.e. per element or row of Fl, F2, F3, 
...) in which ones marks records selected. SLASHIOTARHO behaves like 
dyadic /.p. That is, its result contains all of the indices for the 
specified array (file) which satisfy the specified criterion. The 
elements of RINDS are record indices which can be used to directly 
locate the records. The right argument (beyond FP) is an integer 
vector of the indices of the field variables to be constructed before 
executing the expression (e.g. 2 3 for 'CF3>50)AF2#40’ or 1 3 7 for 
'O<PROCESS F3' where Fl and F7 are required by PROCESS as global 
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variables). If SVEC and SFLDS are empty, SLASHIOTARHO returns the 
record indices of all active records in the file. 


SYNTAX:  FP¢FP DELREC RINDS 
FILE€(~CLITe FILE) €RINDS)/FILE 


DELREC deletes specified records from the file. The elements of 
RINDS are the indices of the records to be deleted (as returned by 
IOTA, IOTARHO or SLASHIOTARHO). The result is the modified value of 
FP (if modified). 


SYNTAX: FP¢SVEC COMPRESS FP,SFLDS 
FILE¢ (@SVEC) “FILE 


COMPRESS deletes all records in the file which do not satisfy a 
specifed set of criteria. 


FP¢SVEC COMPRESS FP,SFLDS 
has the same effect as 
FP¢FP DELREC ('~!',SVEC) SLASHIOTARHO FP,SFLDS 


Notice that the former expression does not need to construct 
intermediate record indices for the selected records and so is more 
efficient than the latter expression. If SVEC and SFLDS are empty, 
no records are deleted. 


SYNTAX: MAT¢RINDS INDEX FP,FLDS 
MAT¢FILECRINDS ; FLDS ] 


INDEX is used to retrieve from file the data (MAT) from selected 
fields (FLDS) for specified records (RINDS). The elements of RINDS 
are the indices of the records to be retrieved (as returned by IOTA, 
IOTARHO or SLASHIOTARHO). The elements of FLDS are indices of the 
fields to be retrieved. The result is a matrix of the retrieved data 
with one row per element of RINDS and one column per element of FLDS. 


SYNTAX: RINDS INDEXWS FP,FLDS 
FI¢FILECRINDS;1] © F2¢FILECRINDS;21 © ... 


INDEXWS is used to retrieve from file the data from selected fields 

(FLDS) for specified records (RINDS). The elements of RINDS are the 
indices of the records to be retrieved (as returned by IOTA, IOTARHO 
or SLASHIOTARHO). The elements of FLDS are indices of the fields to 
be retrieved. The retrieved data are assigned to global variables 
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named Fn where n is the number of the field retrieved (e.g. F3 and F7 
for FLDS<3 7). The global variables are vectors with one element Cor 
matrices with one row) per element of RINDS. 


SYNTAX: MAT¢SVEC SELECT FP,FLDS,0,SFLDS 
MAT<( @¢SVEC)/FILE[ ;FLDS] 


SELECT is used to retrieve from file the data (MAT) from selected 
fields (FLDS) for all active (not deleted) records in the file which 
satisfy a specified set of criteria (SVEC). 


MAT¢+SVEC SELECT FP,FLDS,0,SFLDS 
has the same effect as 
MAT€(SVEC SLASHIOTARHO FP,SFLDS) INDEX FP,FLDS 


Notice that the former expression does not need to construct 
intermediate record indices for the selected records and so is more 
efficient than the latter expression. If SVEC and SFLDS are empty, 
SELECT retrieves data for all active records in the file. 


SYNTAX: SVEC SELECTWS FP,FLDS,0,SFLDS 
FICC @SVEC)ZFILEL31] © F2¢€C@SVECIJZFILEL3;2] Oo ... 


SELECTWS is used to retrieve from file the data from selected fields 
(FLDS) for all active (not deleted) records in the file which satisfy 
a specified set of criteria (SVEC). 


SVEC SELECTWS FP,FLDS,0,SFLDS 
has the same effect as 
(SVEC SLASHIOTARHO FP,SFLDS) INDEXWS FP,FLDS 


Notice that the former expression does not need to construct 
intermediate record indices for the selected records and so is more 
efficient than the latter expression. If SVEC and SFLDS are empty, 
SELECTWS retrieves data for all active records in the file. 


SYNTAX: FP€¢RINDS INDEXA (FP,FLDS) ASSIGN MAT 
FILECRINDS ; FLDS J<MAT 


INDEXA is used to replace on file the data in selected fields (FLDS) 
for specified records (RINDS). The elements of RINDS are the indices 
of the records to be replaced (as returned by IOTA, IOTARHO or 
SLASHTOTARHO). The elements of FLDS are indices of the fields to be 
replaced. MAT is a matrix of the data to be replaced with one row 
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per element of RINDS and one column per field (or per column of a 
matrix field) identified in FLDS. Mat may be a vector if FLDS 
identifies a single vector field. Regardless of the number of 
records identified by RINDS, MAT may be a one-row matrix or vector 
with one element per column of the fields, or a scalar or one-element 
array. The data will be reshaped and assigned to the specified 
records. The result is the modified value of FP. The ASSIGN 
function simply assigns its right argument to <assign> and returns 
its left argument. INDEXA erases the variable <assign> when done 
with it. 


SYNTAX: FP€RINDS INDEXWSA FP,FLDS 
FILECRINDS;1]¢F1 © FILECRINDS;21]¢F2 © 


INDEXWSA is used to replace on file the data in selected fields 
(FLDS) for specified records (RINDS). The elements of RINDS are the 
indices of the records to be replaced (as returned by IOTA, IOTARHO 
or SLASHIOTARHO). The elements of FLDS are indices of the fields to 
be replaced. The replaced data is taken from global field variables 
named Fn where n is the number of the field replaced (e.g. F3 and F7 
for FLDS¢3 7). The global variables are vectors with one element (or 
matrices with one row) per element of RINDS. If the field variable 
is a one-row matrix or a vector with one element per column of the 
matrix field, it will be applied across all records identified by 
RINDS. If the field variable is a scalar or one-element array, it 
will be applied across all records and all columns of the field. The 
result is the modified value of FP. The field variables are erased 
upon successful completion of the function. 


SYNTAX: FP¢CFP,XFLDS) EXECUTE XVEC 
@XVEC 


EXECUTE is used to execute a specified (XVEC) character vector APL 
expression (e.g. 'SUM¢SUM++/F4'). The expression is stated in terms 
of field variables Fl, F2, F3, ... which represent the data stored in 
the lst, 2nd, 3rd, ... fields of each active (not deleted) record. 
The expression is executed once for each block of records on file 
(each active record in a record oriented file organization or each 
set of records in a multi-set transposed file organization). The 
left argument (beyond FP) is an integer vector of the indices of the 
field variables involved in the expression. Positive indices 
indicate fields to be read from file before execution of the 
expression; negative indices indicate fields to be replaced on the 
file after execution of the expression. The result is the modified 
value of FP. For example: 
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FP¢CFP,° 7 7 2 4) EXECUTE ‘'F7¢F7[F2+F4' 


SUM¢€O 
Aa XTAB is a function which assumes globals F3, F6, F9: 
FP€(FP,3 6 9) EXECUTE 'SUM¢SUM + XTAB' 


SYNTAX: FP¢CFP,XFLDS,0,SFLDS) EXECUTE XVEC FOR SVEC 
eXVEC 


In this context, EXECUTE is used to execute a specified (XVEC) 
character vector APL expression for only those active (not deleted) 
records which satisfy a specified set of criteria (SVEC). The FOR 
function simply catenates and returns its two character vector 
arguments, separating them by a newline character. Both expressions 
are stated in terms of field variables Fl, F2, F3, ... which 
represent the data stored in the lst, 2nd, 3rd, ... fields of each 
record. SVEC should be constructed such that, when executed, its 
result is a Boolean vector with one element per record (i.e. per 
element or row of Fl, F2, F3, ...) in which ones mark records to be 
selected for subsequent construction of the field variables to be 
included in the execution of XVEC. XFLDS and SFLDS are integer 
vectors of the indices of the field variables involved in the 
respective expressions XVEC and SVEC. Negative elements of XFLDS 
indicate fields to be replaced after execution of XVEC. The result 
is the modified value of FP. For example: 


FP¢C(FP, 7 7 2 4 0 3) EXECUTE 'F7¢F7I>F2+F4’ FOR 'F3>1' 


SYNTAX: RINDS€CFP,KFLD) IOTA VALUES LAYERS 2Z 
RINDS¢IOTARHO FP LAYERS 2Z 
RINDS€SVEC SLASHIOTARHO FP,SFLDS LAYERS Z 
FP¢SVEC COMPRESS FP,SFLDS LAYERS Z 
MAT¢SVEC SELECT FP,FLDS,0,SFLDS LAYERS Z 
SVEC SELECTWS FP,FLDS,0,SFLDS LAYERS Z 
FP¢CFP,XFLDS) EXECUTE XVEC LAYERS Z 
FP¢CFP,XFLDS,0,SFLDS) EXECUTE SVEC FOR SVEC LAYERS Z 


LAYERS can be used when the file is layered (see Efficient Record 
Location section in this chapter). When used, the scope of the file 
utility function is limited to just those records whose layer value 
is in the list of layer values Z. All other records are ignored. 

The LAYERS function simply assigns its right argument to <layers> and 
returns its left argument. The file utility function erases the 
variable <layers> when done with it. 
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PROBLEM: Design and document a precise file layout to implement the 
file utility functions of the previous section for a 
multi-set transposed file organization. Keep the layout 
general enough to allow a deletion flag or not, and to 
allow layers or not. 


TOPIC: Multi-Set Transposed File Organization 


Here is a possible file layout for a multi-set transposed file 
organization. A discussion follows it. 


FILE NAME: up to you TIE NUMBER: up to you 


DESCRIPTION: Multi-set transposed file with optional record 
deletion flags and optional layers. 


COMP. 
NO VARIABLE 
1 DOC 
2 FN 
3 FD 
4 FT 


5-6 (latent) 


DESCRIPTION 

Character matrix or vector (with embedded 
newlines) description of the purpose and 
contents of this file. [optional] 


Character matrix of abbreviated field names 
(as valid identifier names) with one left 
justified name per row. [optional] 


C1ITeFN)=CiloFT) 


Character matrix of full field descriptions 
with one description per row. [optional] 


(1TeFD)=(1loFT) 


Two row integer matrix of field type 
information with one column per field (FPI[3] 
columns). The meanings of the values are: 


FT[1;] Field width. If 0, the field is 
inactive (latent). If 1, the field is 
a vector field. Otherwise, the field 
is a matrix field with this many 
columns. 


FTC2;] Field datatype. Options: 1 (Boolean), 
2 (character), 3 Cinteger), 
4 (floating point). 


Available for custom requirements. Contain: 10. 
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7 FP 
8 RPS 
9 LV 
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DESCRIPTION 


Integer vector of file parameters. The 
meaning of the values are: 


FP(1] 


FPC2] 


FP(C3] 


FP[4] 


FPC5] 


FPCL6] 


FP[7] 


FPC8] 


FP[9] 


FPC10] 


FP[11] 


Field number of layer value field 
(origin 1) or 0 if file not layered. 


Tie number used when accessing file 


Number of fields, including latent 
fields. FP([3]=(C1lloFT). 


Number of components preceding the 
data components. FP[4]=210. 


Maximum number of records per set of 
components. A/FP[5]=2RPS. 


Number of sets, including sets with 
no active records. 
FPL6]=CeRPS)=CopARPS). 


Number of records, including records 
flagged for deletion. FP(71]=(+/RPS). 


Number of active (not latent) fields, 
including the deletion flag field if 
one exists. FP[8]=(€+/xFT[1:]} 


Number of active (some active records) 
sets. FPL9]=(€+/0#ARPS). 


Number of active (not flagged for 
deletion) records. FP([101]=(+/1ARPS). 


Magnitude is field number of deletion 
flag Boolean vector field Corigin 1) 
or 0 if no deletion flag. If nonzero: 
LA.=FTES IFPCIlId. If negative: FP[5] 
inactive records are added for each 
new set, i.e. FP(I5]A.=RPS. 


Integer vector with one element per set (FPI[6] 
elements) of the number of records, including 
those flagged for deletion, in each set. 


Layer values. LV is a vector with one element 
per set (FPI6] elements) if FT[£1;FP[1]]=1 or 

a matrix with one row per set if FT[1:FP[11]1]>1. 
LV is not used (empty numeric vector) if 


FPEL11=0. 
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COMP. 

NO. VARIABLE DESCRIPTION 

10 ARPS Integer vector with one element per set (FPI[6] 
elements) whose magnitude is the number of 
active (not flagged for deletion) records in 
each set. If positive, the active records in 
the corresponding set are the leading records 
(no interspersed deleted records). If 
negative, the active records are not 
exclusively the leading records. This 
component is not used (empty numeric vector) if 
FP£111]=0. 


11 to (latent) Available for custom requirements. Contain: 10. 
FPL 4] 


(F+FP[4]) DATA Array of data for field F in set S. The array 
+(FP[31x is an empty numeric vector (latent) if 
S-1) FTC1;FJ=0. It is a vector if FTC1;F]=1. It is 
an n-column matrix (n¢FT{[1;F]) if n#1. The 
where: array (if not latent) has RPS[S] elements or 
rows, of which IARPS[S] are active, i.e. not 
1<FSFP[3] flagged (by a corresponding 0 in field !FP(11]) 
1<S<FP[6] for deletion. The array will never have more 
than FP[5] elements or rows (i.e. records), and 
will always have exactly FP[5] elements or rows 
if FPfC111<0. 


Components 1 (DOC), 2 (FN) and 3 (FD) are not essential pieces of the 
file. However, since they serve to document the file, they are 
included and recommended. 


Components 4 (FT) and 7 (FP) completely define the structure of the 
file. They are constructed to satisfy the requirements of the 
particular application. After creating an empty file (via OFCREATE 
or OCREATE) and assigning values to FT and FP, you initialize the 
file by calling INITFILE with FP and FT as its arguments. 


Components 5 and 6 contain empty vectors and are not used. It is a 
good practice to leave a few "latent" components for future 
unanticipated requirements. 


Components 8 (RPS), 9 (LV) and 10 (ARPS) are used and updated as 
needed by the utility functions. 


Components 11 to FP[4] are additional latent components in case more 
than 2 (components 5 and 6) are needed for the particular application. 


All components beyond FP[4] are reserved for the data. 
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The implementation of the file utility functions for this specific 
file layout is left as an exercise in the problems at the end of the 
chapter. 


This file layout is illustrated and discussed further in the next 
section. 
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PROBLEM: Construct a file for maintaining a database of information 
about 45,000 insurance policyholders. For each 
policyholder, you will keep track of: policy number (12 
digits/letters), issue age, issue date, sex (M or F), 
classification (S,A,B,C,D) and face amount. Layer the file 
by classification. 


TOPIC: An Illustration of File Utilities 


The first step is to define the fields for this file. They are: 


Face amount 
Deletion bit 


1. Policy number 
2. Issue age 

3. Issue date 

4. Sex 

5. Classification 
6. 

ye 


The last field, deletion bit, is optional. You should consider how 
the file will be used before you decide whether or not to employ a 
deletion bit field. If employed, record deletion is quick because 
deleted records are not physically removed from file. They are 
simply flagged for deletion (bit=0). However, record searching and 
retrieval will be slower because "deleted" records must be detected 
and ignored. 


For this illustration, we will employ the deletion bit field. 


We have defined 7 fields but let’s allow room for another 3 fields 
for future expansion: 


8. Clatent) 


9. Clatent) 
10. Clatent) 
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The next step is to specify the nature of each field precisely hy 
constructing FT, which will be component 4 of the file. 


WIDTHe12 1131311100 0 
TYPE ¢€ 2332243131121 
FT¢WIDTH,C.5)/TYPE 


Since the policy numbers (field 1) may contain letters as well as 
digits, they will be stored in a 12 column character (type 2) matrix 
field. Issue age (field 2) and issue date (field 3) will be stored 
as integer (type 3) vector (width 1) fields. The dates will be 
stored in YYYYMMDD format. Sex (field 4) and classification (field 
5) will be stored as character vector fields. Face amount (field 6) 
will be stored as a floating point (type 4) vector field since the 
values may contain cents but will be stored in dollar units. 

Deletion bit (field 7) must be stored as a Boolean (type 1) vector 
field. Fields 8 to 10 are stored as latent (width 0) Boolean fields. 


Next, define the elements of FP, which will be component 7 of the 
file. The file will be layered by classification which is field 5: 


LAYER¢5 

Let's pick a tie number to which the file will be tied when used: 
TIE€¢987 

There are 7 active fields and 10 fields in all: 


AFLDS¢7 
INCR€10 


The file begins with 10 reserved components. Let's include another 
40 latent components (11 through 50) in case we later need a place to 
store related items. 


DISP¢50 


The file will contain 45,000 or so records. lLet’s keep our 
components down to some manageable size so WS FULL errors are kept to 
a minimum and so new records may be added without having to read 
giant objects. Let’s arbitrarily limit each data object to 2000 
records. This means the 45,000 record file will contain at least 23 
sets of data components. To read one field for all records will 
require at least 23 file read operations. By increasing the blocking 
factor, the number of sets decreases, and vice versa. For some 
applications, you may want a block size of 30,000. For others, a 
block size of 10 may be ideal. Here, we will use 2000. 


BLK€2000 
Since we are uSing a deletion bit field, we can choose either of two 


methods for adding new sets. After the 2000th record is added to the 
last set of the file and another record is to be added, a new set 
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must be appended to the file. This set mayh be appended with just 
the single new record, or with 2000 records, 1999 of which are 
flagged deleted. If the latter approach is taken, subsequent new 
records will simply replace "deleted" records. 


From your point of view, as the programmer using the file utility 
functions, the utility functions behave the same regardless of the 
approach chosen. Which approach you should use is a function of your 
APL file system implementation. Some systems behave poorly (i.e. 
gobble up much disk storage) when asked to replace an object ina 
component by a larger object. On such systems, you should choose the 
latter approach so that all the records of the set are added at 

once. Then, the components in that set will not grow. 


Here, we will choose the latter approach by specifying the number of 
the deletion field as a negative number: 


DFLD¢ 7 
Let’s construct FP: 
FPCLAYER,TIE,INCR,DISP,BLK,0O,0,AFLDS,0,0,DFLD 
We will name the file 'POLDATA’. Create the file and run INITFILE: 


’POLDATA’ OFCREATE TIE Cor OCREATE on SHARP APL) 
FP INITFILE FT 


At this point the file has 50 (i.e. DISP) components, no records and 
no sets. Before we start adding records, let's take a valuable 
moment to construct and replace the 3 documentation components: 


DOCe'’Insurance policyholder database’ 

FN¢7 5o0'PNUM IAGE IDATESEX CLASSFACE DBIT '! 
FD€(15T'Policy number'),[1]...,(0.51]15T'Deletion bit’ 

DOC OFREPLACE TIE,1 Cor OREPLACE on SHARP APL) 
FN OFREPLACE TIE,2 

FD OFREPLACE TIE,3 


From this point on, we can simply use the file utility functions. 
Let's catenate 4 records: 


Fl¢€C1l2t’ABCD’),£C1iIC12T'AL’),C11012T'XY¥Z99'),0.5112T' P55! 
(policy number) 


F2€25 55 45 35 (issue age) 
F3€¢€19820715 19850123 19851230 19860402 (issue date) 
F4¢'M' (sex, all male) 
F5¢'SASS' (classification) 
F6¢50000 30000 40000 25000 (face amount) 


FP¢FP CATRECWS 4 
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Which ones are 45 or older? 
'F2245' SELECT FP,1 0 2 
XYZ99 
Al 
Make the 45 year old a female: 
FP¢(CFP,2) IOTA 45) INDEXA FP,4 ASSIGN 'F! 


Increase each face amount by a factor of 100 for those with standard 
(S) classification: 


FP¢+(FP, ~6 6) EXECUTE 'F6¢100XF6’ LAYERS 'S’ 
Return the ages of those with standard (S) classification: 


»’’ SELECT FP,2 LAYERS 'S’ 
25 45 35 


Delete policyholder records with issue dates in 1985: 
FP¢'19854LF3+10000' COMPRESS FP,3 

When done uSing the file, untie it: 
OFUNTIE FP(2] Cor OUNTIE on SHARP APL) 


To use the file again later, retie the file and read the file 
parameters vector from component 7: 


'POLDATA’ OFTIE 987 Cor UTIE on SHARP APL) 
FP¢COFREAD 987 7 Cor OREAD on SHARP APL) 


You can then use the file utility functions again. 
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PROBLEMS : (Solutions on pages 383 to 448) 


1. Suppose you have developed an application and are encountering 
frequent WS FULL error messages. Perhaps there are too many 
functions in the workspace. Design and implement a set of file 
utility functions which may be used to store some of these 
functions on file and to retrieve them when needed. 
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2. Sketch a flowchart of the FILEDOC function described in this 
chapter. Compare it to the flow of the FILEDOC function listed 
in the solutions at the back of the book. 


3. In the Workspace Design and Documentation chapter, a simple 
application system is developed for maintaining a list of 
employees. Rewrite the EMPLOYEES function (the large one) to 
assume that employee information will be kept on file rather than 
in global workspace variables. Do not design a precise file 
organization. Rather, use the file utility functions introduced 
in this chapter. 


4. Try your hand at writing one or more of the file utility 
functions for the specific multi-set transposed file layout 
presented in this chapter. Compare your function to the listing 
of that function included in the solutions at the back of the 
book. The functions listed require an APL*PLUS system. If you 
use SHARP APL or APL2, see the next problem. 


5. What general modifications must be made to the APL*PLUS system 
file utility functions written in the previous problem so that 
they will work on a SHARP APL system? On an APL2 system? 


a 


Chapter 15 


BOOLEAN TECHNIQUES 


The treatment of logical conditions in APL is simple and 
powerful. The concept of "true" is represented by the numeric value 
1 and "false" by 0. These values may be manipulated with the same 
ease as those of any other numeric values. An array which contains 
only 1s and Os is called a "Boolean" array (after the mathematician 
George Boole) or a "bit" array (after the unit of computer storage). 


The APL language contains 7 primitive functions which return 
exclusively Boolean results (=,#,>,2,<,#,€). These are called 
relational functions. There are 5 primitive functions which not only 
return Boolean results, but also require exclusively Boolean 
arguments (~,A,V,“%,¥). These are called logical functions. 


On the surface, these 12 functions provide an adequate, but not 
wonderful, set of capabilities for working with logical data. 
However, by applying the APL operators (such as reduction and scan) 
to these functions and by utilizing the "shift and compare" 
techniques available in APL, you can begin to appreciate the rich 
Boolean functionality of APL. Extremely complex problems can be 
solved directly using noniterative Boolean techniques that would 
never even be considered in another programming language. 


This chapter presents no new or complex APL primitive functions. 
Rather it presents deeper interpretations of existing simple 
functions. The aim of the chapter is to develop your Boolean 
vocabulary and to broaden your thinking when faced with Boolean 
problems. 
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PROBLEM: The dyadic logical functions Aa, Vv, a, ¥ require Boolean 
arguments and return Boolean results. Since they are 
scalar functions, the result is a scalar when both 
arguments are scalars. For the set of four possible pairs 
of arguments (0 and 0; O and 1; 1 and O; 1 and 1), each 
function returns its four distinct Boolean results. For 
example, the results of the A (Cand) function may be 
expressed in the table: 


Right Argument 


en 
Argument 
2 ie 


Viewing this table of A results as a vector (0 0 0 1), you 
can see that the results for Vv, a, ¥ are respectively: 
O12131,131310, 1000. There are 16 possible 
combinations of 4 bits. These 4 functions produce only 4 
of them. What functions may be used to generate the rest? 
What logical interpretations can be given to each of these 
functions? 


TOPIC: Logical Scalar Functions 


For a left argument L¢0 0 11 and a right argument R¢0 1 01, the 16 
possible combinations of results may be generated by the following 
expressions: 
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Expression 

C(L¢0 0 1 1) 
Result (R¢eO 10 1) Interpretation Equivalent 
000 0 CoLJe0 Always false 
0001 LAR And (both; "multiplication" ) 
001 0 L>R Except Cunless; and not; LAC~R) 

"subtraction" ) 

0011 L Left argument 
0100 L<R Nor not C(~LJAR 
0101 R Right argument 
0110 L#R Toggle if Cexclusive or) 
0111 LYR Or ("addition") 
1000 LYR Nor (neither) ~CLVR) 
1001 L=R Toggle if not 
1010 ~R Not right argument 
10daqai1 L2R Or not LY(~R) 
110 0 ~L Not left argument 
1101 L<=R Nand not (~L)VR 
1110 LAR Nand Cnot both) ~(C LAR) 
be. de. od CoLJel Always true 


To illustrate the use of these logical expressions, let us solve some 
problems. 


A. 


Given a vector DEP of bank deposits, how many deposits are 
between (inclusive) 100 and 200? 


+/(DEP2100)ADEP<200 Cand) 
+/(DEP<100)¥DEP>200 (nor) 


How many are greater than 250, ignoring those which are exactly 
500? 


+/(DEP>250)>DEP=500 Cexcept, and not) 
+/ (DEP=500)<DEP>250 (nor not) 


(It is easy to see why the second expression is typically read 
"how many deposits where (DEP>250) except where (DEP=500)'"', 
rather than "how many deposits where (DEP=500) nor not where 
CDEP>250)'". ) 


How many are either 100 or greater than 250, ignoring those 
which are exactly 500? 


+/(DEP>250)#DEPE100 500 (toggle if) 
+/(DEPS250)=DEP€100 500 (toggle if not) 
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D. How many are smaller than 10 or larger than 1000? 


+/(DEP<10)VDEP>1000 Cor) 
+/(DEP210)ADEP<1000 Cnand) 
+/(DEP<10)2DEP<¢1000 Cor not) 
+/(DEP210)=DEP>1000 Cnand not) 
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PROBLEM: There are 10 scalar dyadic relational or logical functions 
C=, #, >, =, <, =, A, VY, A, ¥)J. All of these may be used 
with the reduction or scan operators to derive functions 
which can operate on Boolean arrays. Which of these 20 
derived functions have useful interpretations? What are 
the interpretations? 


TOPIC: Logical Reductions and Scans 


Obviously, the word "useful" is subjective. So let us be 
subjective. Only two of the reductions have useful interpretations. 
However, we will throw in a third reduction (+/) since its 
interpretation becomes "how many" rather than "add up" when its 
argument is Boolean. 


Expression Interpretation 
+/R How many 
A/R All 
V/R Any 


To illustrate these functions, let us solve some problems. 


A. Given a character vector CVEC, how many nonblank elements does 
it contain? 


+/CVEC#' ' Chow many) 
B. Are all of the elements nonblank? 


A/CVEC#! ' Call) 
~V/CVEC=' ' Cnot any; none) 
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Six of the scans have useful interpretations. 


Expression Interpretation 
A\R Leading 
V\R Not leading not (leading 0s) 
<\R First 
<\R Not first not (first 0) 
#\R Leading 1-poles to 1-maps 
=\R Leading O-poles to O-maps 


Let us solve some problems using these functions. 
A. How many leading blanks (blanks before the first nonblank) exist 
in the character vector CVEC? 
+/A\CVEC=!' ' Chow many leading) 
B. Delete the leading blanks from CVEC, returning the elements 
beyond the leading blanks. 


CV\CVEC#' ')/CVEC (leading Os) 


Cc. Is the first nonblank character in CVEC a digit? 
V/CC<\CVEC#! ')/CVEC)J€'0123456789' (first) 


(v/, i.e. any, is used in case there is no first nonblank) 


D. Delete the first ’',’' from CVEC, returning the rest of CVEC. 


C=\CVEC#!' ,')/CVEC (first 0) 


The remaining two scans (#\ and =\) require some explanation before 
using them to solve problems. A "maps" vector is a Boolean vector 
which consists of sets of contiguous 1s (1-maps) separated by one or 
more Os (O-maps). For example, the following bit vector contains 3 
1l-maps (each of which is underlined) and 4 0-maps: 


0O0121212120003212002312d2312d312d31200 


A "leading 1-poles" vector is a Boolean vector which consists of 
pairs of 1s, separated by zero or more Os. The 1s are called 
"poles". The left pole in each pair may be viewed as the starting 
element of a set of contiguous elements. The right pole in each pair 
may be viewed as the next element beyond the ending element of the 
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set. For example, the following bit vector contains 3 pairs of 
leading 1-poles. 


Notice that l-maps and leading 1-poles are alternate means of 
conveying the same information. Specifically, they each identify 
spans of contiguous elements. 1-maps do so by using 1s to flag the 
elements within the spans. Leading 1-poles do so by using 1s to flag 
the starts of spans and the starts of non-spans Chence the word 
"leading"). 


The #\ function converts bit vectors from the leading 1-pole 
representation to the 1-maps representation: 


co 


\Oe 


O¢7\0 0100001 0 
001210 0 0 1 0 
0O01i1a3t1 0 0 


The =\ function converts bit vectors from the leading O-poles 


representation (use Os as poles instead of 1s) to the O-maps 
representation (use OS as maps instead of 1s). 


a) He d (Nes aie ieee Res Jae 
1101 1 
110 0 0) 


ae) 
OOF 
OrRP 
HOP 
rr oO 
oom 
Or Pp 
OrO 
OrP- 
OrO 
FOP 


0 
1 
1 
Let us solve some problems. 


E. Given a character vector CVEC, return all characters within 
quotes. 
CA#\CVEC='''')/CVEC 
(This expression also returns the leading quote of each quote 


pair and the second quote from each pair of "doubled" quotes 
within quote pairs.) 


F. Return everything in CVEC except quote characters or characters 
within quotes. 


Te=\CVEC#'''! 
(TA~10T) /CVEC 


PN PN PA AG Pe Ps AN Pus Os AG AN Pus Ps BD Rae Na Ne Ba Aas A PN Os ON 


=23 09> 


Chapter 15 BOOLEAN TECHNIQUES 


PROBLEM: In the last expression of the final illustration in the 
prior section, a shift-and-compare operation (TA™10T) is 
performed to produce the effect of extending each O-map to 
the right by one element. What other shift-and-compare 
operations have useful interpretations when applied on 
Boolean arrays? What are the interpretations? 


TOPIC: Logical Shift-and-Compare (Map) Operations 


In the following list of shift-and-compare operations, the catenate 
and drop technique (e.g. ~110,B) is used instead of the rotate 
technique (e.g. ~10B). The reason for this choice is that the rotate 
technique has the undesirable effect of "filling" the first element 
Cor last element if 10B) with the arbitrary value of the last (or 
first) element of the array, rather than with the 1 or 0O needed to 
make the comparison work in every case. 


Also notice that the catenate is done before the drop (e.g. ~110,B) 
instead of the other way around (e.g. 0,°7114B) so that the expression 
will behave correctly when the argument is empty. 


Expression Interpretation 
R#-110,R 1-maps to leading 1-poles 
R="111,R O-maps to leading 0O-poles 
R>~110,R 1l-maps to first 1 bits 
R27111,R O-maps to first 0 bits 
RV~110,R extend 1-maps to right by 1 
(shorten O-maps from left by 1) 
RA~111,R extend O-maps to right by 1 


(shorten l1-maps from left by 1) 
Since each shift-and-compare operation transforms a map, these 
operations are sometimes called "map" operations. 
Let us solve some problems using these operations. 
A. Given a character vector SENTENCE, return the lengths of the 
words in it. A word is any set of contiguous letters. 


LETTERS¢ ’ ABCDEFGHIJ KLMNOPORSTUVWXYZabcdef...xyz’ 


MAPS¢ (SENTENCE€LETTERS ) , 0 CO to insure last element 
not a letter) 

POLES¢MAPS#~110,MAPS (1l-maps to leading 1-poles) 

INDS¢€POLES/tePOLES Cindices of poles) 

INDS¢( CCoINDS)+2),2)oINDS (reshape to 2 column matrix) 

~/OINDS (subtract in pairs) 
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B. Determine the indices of the first letter of each word. 


MAPS¢SENTENCE€ LETTERS 
(MAPS>~110,MAPS)/tLeMAPS (l-maps to first 1 bits) 


Cc. Given a numeric vector BAL, how many times do the values go from 
positive to negative or vice-versa? 
MAPS¢<BAL<O 
+/1J/JMAPS#~110,MAPS (l-maps to leading 1-poles) 


D. Delete the extraneous (leading, trailing or redundant) blanks 
from a character vector CVEC. 


NB¢CVEC#F’ ' 
CVEC€ (NBV~110,NB)/CVEC Cextend 1-maps to right by 1) 
CVECe(-' '= "1 TCVEC)ICVEC (drop last element if blank) 
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PROBLEM: Some applications, such as text processing, are well-suited 
to the Boolean techniques discussed in the prior sections. 
These techniques allow you to analyze an array without 
iterating by character. For example, (+/A\R=' ') tells you 
how many leading blanks are in the array R. At times, you 
will want to work with an array which is the catenation of 
several arrays (e.g. the sentences in a paragraph). Then 
you may want to apply the Boolean operations on each 
respective "subarray" (e.g. the number of leading blanks in 
each sentence of a paragraph). Design and implement a set 
of Boolean utility functions which will perform the 
operations described in the prior sections, for each of the 
subarrays in a specified array. 


TOPIC: Logical Partition Operations 


We will use the term "partition" to refer to each subarray of an 
array. For example, we may view the following character vector as 
consisting of 3 partitions (sentences): 

‘HELLO. THIS IS A SAMPLE. WHAT DO YOU THINK?!’ 


There are a number of different methods which may be used to define 
where in this character vector each of the partitions begins and 
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ends. For example, we could specify the indices of the starting 
elements and of the ending elements for each partition; or we could 
specify the starting indices and the lengths of each partition. 


Since we will be using Boolean techniques discussed above, we will 
define the locations of the partitions by specifying a Boolean vector 
which has as many elements as the character vector and which has is 
in the indices which correspond to the first element of each 
partition. Such a Boolean vector will be called a "partition vector". 


A partition vector for the character vector above is defined below 
(with spaces removed from the display of the partition vector for 
clarity): 


CVEC¢’'HELLO. THIS IS A SAMPLE. WHAT DO YOU THINK?’ 
PV € 100000100000000000000000010000000000000000000 


We desire a set of functions which will perform each of the Boolean 
operations described in prior sections, but will do so independently 
on each of the partitions of a specified array. Since the functions 
require the knowledge of how the array is partitioned, the partition 
vector must be an argument to each function. For example, if +/ and 
A\ were defined to permit a partition vector left argument, we could 
determine the number of leading blanks in each sentence of CVEC with 
the following expression: 


PV+/PVA\CVEC=' ' 
The result would contain one element per partition (e.g. 0 2 2). 
Since +/ and A\ do not, in fact, accept partition vector left 


arguments, we will design our own set of "partition functions". 


Non-Partition Partition (PV) 


Expression Expression 
+/R PV pPLUSRED R 
A/R PV pANDRED R 
v/R PV pORRED R 
A\R PV pANDSCAN R 
V\R PV pORSCAN R 
<\R PV pLTSCAN R 
<\R PV pLESCAN R 
#\R PV pNESCAN R 
=\R PV pEQSCAN R 

R#7110,R PV pNEMAP R 
R="111,R PV pEQMAP R 
R>7~110,R PV pGTMAP R 
R2=7111,R PV pGEMAP R 
RV~110,R PV pORMAP R 
RA7~111,R PV pANDMAP R 
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(Notice that the logical scalar functions (e.g. L>R) are not included 
here since the scalar functions work correctly whether or not their 
arguments are partitioned. ) 


The number of leading blanks in each sentence in CVEC is: 
PV pPLUSRED PV pANDSCAN CVEC=' ' 


The use of these functions will be further illustrated in the next 
section. 


The definitions of these functions make heavy use of the Boolean 
techniques described in prior sections. You may want to study the 
definitions of some of the functions to become better acquainted with 
actual applications of Boolean techniques. 


(The algorithms underlying many of these functions were conceived by 
Robert A. Smith of STSC and are introduced in the publication, 
Boolean Functions and Techniques, 1975, Scientific Time Sharing 
Corporation. ) 


C[WSID: BOOLEAN ] 
V R¢P pPLUSRED B;T 
Returns +/S for each partition §S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
element of each partition. 


Works on Boolean B only. For numeric B: 
R¢C1O0P)/+\B 
R¢R- 110,R 


co 
> 
= 
DDDDIDD ND 


rm 
\O 
a) 
D 


[10] A Compress partition vec for just 1 bits and 
[11] A leading bits: 
[121] TepRe(PVB)/P 
[13] A Convert to indices: 
C14] R¢R/uT 
[15] aA Lengths of compressed partitions: 
[16] R¢eC1JR,OI0O+T)-R 
[17] A Deduct 1 for partitions with leading 0 bit: 
[18] R¢R-~P/B 
V 


aa A SS 
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[WSID: BOOLEAN ] 
V R€P pANDRED B 
Returns A/S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
element of each partition. 
Compress partition vec for just O bits and 
leading bits: 
C7] R¢(P2B)/P 
[8] a Which partitions have no Os beyond leading bit? 
C9] R¢€R/10R 
[10] A ...and have a leading 1 bit: 
C11} ReRAP/B 

V 


mo 
W 
hel 
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V R€P pORRED B 
Returns v/S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
element of each partition. 
[5] Compress partition vec for just 1 bits and 
[6] leading bits: 
C7] R¢(PVB)/P 
[8] aA Which partitions have no 1s beyond leading bit? 
[9] R¢R/10R 
[10] A Leading 1 bit or any trailing ls: 
C11] R¢R<=P/B 

V 


C1] 
C2] 
C3] 
[4] 
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V R€P pANDSCAN B;T 
Returns A\S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
element of each partition. Uses fact 
[5] that A\A «> ~v\~A. 
[6] Consider just 0 bits and leading bits: 
(7 ] T¢P2B 
[81 aA All 1S except leading 1 bits (Cas Os): 
C9] R¢~T/B 
[10] A 1-maps to l-poles and expand: 
C11] R¢T\R#°110,R 
[12] A 1-poles to l-maps and toggle: 
[13] Re~#\R 

Vv 


C1] 
C2] 
C3] 
C4] 


DDDOIODD 
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[WSID: BOOLEAN ] 

VY R¢P pORSCAN B;T 
Returns vV\S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
element of each partition. 
Consider just 1 bits and leading bits: 
C6] T<+PVB 
[7] aA All 1s except leading O bits: 
C8] R¢T/B 
C9] A 1-maps to 1-poles and expand: 
[10] R¢T\R#°110,R 
[11] A 1l-poles to 1-maps: 
[12] R¢#\R 

V 


m= 
uw 
wi 
DDDDD 
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V R€P pLTSCAN B:T 
[1] aA Returns <\S for each partition S of B, 
[2] A where P is the corresponding Boolean 
[3] A partition vector whose 1s mark the first 
C4] a element of each partition. 
[5] a Consider just 1 bits and leading bits: 
[6] T¢PVB 
[7] aA All 1S except leading 0 bits: 
C8] R¢eT/B 
[9] A i-maps to leading 1 bits and expand: 
[10] R¢T\R>°110,R 
C11] A Set leading 1 bits to 1: 
C12] R€RVPAB 

V 
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V R€P pLESCAN B;T 
Returns <=\S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
[4] element of each partition. 
[5] Consider just 0 bits and leading bits: 
C6] T¢P2B 
C7] aA All 1s except leading 1 bits (as Os): 
C8] R¢e~T/B 
C9] A 1-maps to leading 1 bits and expand: 
[10] R¢T\R>'110,R 
[11] A Set leading 0 bits to 0; subtract other 
[12] aA leading 1 bits: 
[13] R¢R<B2P 

V 


C1] 
[2] 
C3] 


ODDD D 
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[WSID: BOOLEAN] 
V R¢P pNESCAN B 
Returns #\S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
element of each partition. 
1-poles to 1l-maps, shift right, mark overlap 
leading bits: 
C7] R¢P/-110,7\B 
[8] aA 1-maps to l1-poles, expand, marking leading 
C9] aA bits to toggle: 
[10] ReP\R#4°110,R 
[11] A Toggle selected leading bits, l1-poles to 1-maps: 
[12] R¢#\B#R 

V 


eq 
> 
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V R¢P pEQSCAN B 
Returns =\S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose is mark the first 
element of each partition. 
O-poles to O-maps, shift right, mark overlap 
leading bits, toggle: 
C7] R¢e~P/~111,=\B 
[8] A 1-maps to 1-poles, expand, marking leading 
[9] A bits to toggle: 
[10] R€eP\R#°-110,R 
C11] a Toggle selected leading bits, O-poles to 0-maps: 
[12] R¢=\BA4R 

V 


on 
W 
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V R¢P pNEMAP B 
Returns S#°110,S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
[4] element of each partition. 
[5] 1l-maps to leading 1-poles: 
C6] R¢Bf 110,B 
[7] aA Toggle leading bits which are not correct: 
C8] R¢R#PAB#ZR 

V 


C1] 
[2] 
[3] 


DDDDDND 
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[CWSID: BOOLEAN] 
VY R€P pEQMAP B 
Returns S="111,S for each partition S of B, 
where P is the corresponding Boolean 
partition vector whose 1s mark the first 
[4] a element of each partition. 
[5] aA O-maps to leading 0-poles: 
[6] R¢B='"1/1,B 
[7] a Toggle leading bits which are not correct: 
C8] R¢R#PABFER 

Vv 


rn 
NO 
us 
DD D 
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V REP pGTMAP B 
[1] a Returns S>~110,S for each partition S of B, 
[2] a where P is the corresponding Boolean 
C3] aA partition vector whose 1s mark the first 
C4] aA element of each partition. 
[5] aA 1-maps to first 1 bits: 
C6] R¢B>~110,B 
[71 aA Toggle leading bits which are not correct: 
[8] R¢R#PABFR 

Vv 


C[WSID: BOOLEAN] 

V RP pGEMAP B 
[1] a Returns S2°111,S for each partition S of B, 
[2] a where P is the corresponding Boolean 
[3] aA partition vector whose 1s mark the first 
[4] aA element of each partition. 
[5] a O-maps to first 0O bits: 
C6] R¢B2°71/1.,B 
[7] a Toggle leading bits which are not correct: 
C8] R¢R#PABZR 

V 


CWSID: BOOLEAN ] 

V R¢P pORMAP B 
[1] aA Returns SV°110,S for each partition S of B, 
[2] a where P is the corresponding Boolean 
[3] aA partition vector whose 1s mark the first 
[4] aA element of each partition. 
[5] aA Extend 1-maps to right by 1: 
[6] R¢BvYV 110,B 
[7] aA Toggle leading bits which are not correct: 
[8] R¢R#PABAR 

Vv 
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[WSID: BOOLEAN ] 

V R¢P pANDMAP B 
[1] aA Returns SA°-111,S for each partition S of B, 
[2] a where P is the corresponding Boolean 
[3] aA partition vector whose 1s mark the first 
[4] a element of each partition. 
[5] aA Extend O-maps to right by 1: 
C6] R¢BA~11/1,B 
[7] aA Toggle leading bits which are not correct: 
[8] R¢R#PAB#FR 

V 
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PROBLEM: Write a monadic function RELABEL which will locate and 
modify all of the line labels (and references to those 
labels) in any specified function (whose character vector 
visual representation is provided as the argument to 
RELABEL) such that the labels become Ll, L2, L3,... The 
result of RELABEL is the modified visual representation. 


TOPIC: An Illustration of Boolean Techniques 


The visual representation of a function is a character vector which, 
when displayed, looks just like the display produced by the command 
VFNNAME[O]V. In order to be a character vector, not matrix, which 
displays on several lines, the visual representation necessarily 
contains "newline" (carriage return) characters. For example, 
suppose VR is the visual representation of the simple AVG function: 


eVR 
36 
VR 
V ReAVG V 
C1j ReC€+/VI+eV 


Vv 


To get a more revealing view of VR and why it has the shape displayed 
above, we can replace all newline characters by the "@" symbol, and 
all blank characters by the "_'' symbol. Suppose NL is a scalar 
newline character. 
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T¢VR 
TLCT=NL)/teT]<'@! 
TLCT=' ')/toTIe'_! 
T 


V_R¢EAVG_V@[1]__R€(€+/V)+eV@____V@ 


The visual representation of a function may be generated by using the 
OVR system function in APL*PLUS, or 1 OFD in SHARP APL, or the CRAVR 
function (and OCR) developed in the Workspace Design and 
Documentation chapter. The newline character may be generated by 
using OTCNL in APL*PLUS, OTC[2] Corigin 1) in APL2 or by selecting 
the newline character from OAV (the index of the element depends upon 
the APL implementation; it is 157 in origin 1 for SHARP APL). 


Let us define the header of the desired RELABEL function: 
V NEWVR¢RELABEL VR 


We will find it convenient to view the visual representation as a 
partitioned array, where each line of the function is a partition. 

We will construct a partition vector whose 1s flag the first 
character on each line. Since every line is preceded by a newline, 
except the header line, and since the last character in the visual 
representation is a newline, we may construct the partition vector as 
follows: 


C1} NL¢ 1¢VR=OTCNL 
(Use OTC[2] in APL2 or OAV[157] in SHARP APL instead of OTCNL.) 


To solve this task, we must locate all identifiers, including labels, 
referred to in the visual representation. Identifiers are 
consecutive strings of alphanumeric characters whose first letter is 
alphabetic. However, we must be careful to ignore all such strings 
located within comments or within quotes, since such strings are 
probably words and not identifiers. Further, we cannot simply ignore 
everything beyond the first comment symbol (A) on each line. The 
first comment symbol may be located within quotes and so will not 
actually represent the beginning of a comment. (Note: many APL 
implementations now permit end-of-line comments, in addition to 
full-line comments, and so we cannot assume that the comment symbol 
will appear in a specific position on the line.) 


Our first step is to find and ignore all characters within quotes 
Ceven quotes within comments). Then, the comment symbols not within 
quotes begin genuine comments, so we can use them to find and ignore 
all characters within comments. Finally, by ignoring all characters 
either within quotes or within comments, we can proceed with our job 
of finding identifiers. 


As we Saw in a prior section, =\ can be used to ignore characters 
within quotes, using an expression like: 


C=\CVEC#'''')/CVEC 
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However, this expression cannot be blindly applied to VR since a 
comment may contain an odd quote character (e.g. A YOU DON'T SAY!) 
which will cause mismatching of quote characters. Instead, we must 
perform the =\ (0-poles to O-maps) on each partition (function line) 
of VR individually. 


[2] NQevR#''!! 
[31 NCCON¢NL pEQSCAN NO 


NQ flags the non-quote characters. NCCON is a map vector of the 
characters which are not within quote pairs (i.e. character 
constants) within each function line. Note that the closing quotes 
of each quote pair are included in the map while the opening quote is 
not. 


We can now use NCCON to help us locate all valid comment symbols. 
[4] NC€¢NCCONAVR='A' 


NC flags the non-comment characters (0s flag the valid comment 
symbols). For each line of the visual representation, we wish to 
propogate the 0 which flags the comment symbol in NC so that all 
following characters are flagged with Os (to subsequently ignore 
them). For a single partition, we can use A\. For all partitions, 
we must use pANDSCAN: 


[5] NCMT¢+NL pANDSCAN NC 


NCMT is a map vector of the characters which do not follow a valid 
comment symbol. Note that the comment symbols themselves are not 
included in the map. 


It is now a simple matter to construct a map vector, PARSE, of the 
characters which are not within quote pairs and which do not follow a 
comment symbol within each function line: 


[6] PARSE¢NCMTANCCON 
Our next thrust is to look at the characters flagged by PARSE and to 
construct a pole vector whose poles (pairs of 1s) flag the 
identifiers. First, flag the digit characters, the letters, the 
alphanumeric characters and blanks: 

C7] NUM¢PARSEAVRE!0123456789' 

[8] ALP©PARSEAVRE'ABCD...XYZAabcd...xyza’ 

[9] AN¢NUMVALP 

[10] BL€«PARSEAVR=!' ' 
Then, construct a pole vector of the alphanumeric maps. 

[11] PAN¢AN#~110,AN 


Some of these pole pairs flag identifiers (e.g. COUNT or AMT85 or J) 
while others do not (e.g. 58 or 1E6). We must "turn off" the pole 
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pairs whose first pole does not correspond to an alphabetic 
character. The technique used to do this follows: 


C12] T¢PAN/ALP 
[13] PID«PAN\TvV 196T 


Notice that the compression (/) places the pole values next to each 
other, the map operation extends the value of the first pole (1 or 0) 
into the second pole, and the expansion (\) replaces the poles (some 
Os now) into their original positions. PID is a pole vector which 
flags identifiers. 


Since some identifiers begin with the '0O’ symbol Ce.g. OIO or OPP or 
OEX), we must adjust the poles in PID to include the ‘0D’. 


C14] T¢1OPID 
[15] TeT\'O'=T/VR 
[16] PID¢TVPID> 16T 


Now that we have located all identifiers within the visual 
representation, we must locate the labels. Labels are identifiers 
which immediately follow the bracketed function line number Cignoring 
spaces) and which immediately precede a colon (:). 


Let us find the line numbers. Construct a pole vector of the numeric 
digit maps. 


[17] PNUM¢NUM# -110,NUM 


If the first pole in the PNUM pole pairs is exactly 2 characters 
after a newline character, that pair of poles identifies a line 
number. Flag the character following the "J" character after the 
line number at the beginning of each line. 


C18] START< 1®PNUM\ ~10PNUM/ “190NL 
Notice that / and \ are again used to place the pole values next to 
each other. In this case, the rotate operation changes 1 0 poles to 
O 1 poles so that the last pole of the numeric pole vector 
(corresponding to the "'J") is flagged. 
The next step is to move the bits in START to the right so that they 
correspond to the first nonblank after the bracketed line number. 
Construct a pole vector of the blank maps. 

[19] PBL©BL¥ 110,BL 
Flag the first nonblank character in each line: 


[20] START¢C(START>BL)VPBL\ ~1®PBL/START 


The (START>BL) term is included for lines which do not have any 
blanks between the bracketed line number and the next nonblank 
Character. 
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By matching up START and PID, we can construct a pole vector of 
identifiers which begin with the first nonblank character of a 
function line: 


[21] T¢PID/START 
[22] PSID¢PID\TV 16T 


Construct a pole vector of labels (i.e. identifiers at the beginnings 
of function lines which are followed by a colon): 


[23] Te’:’'=PSID/VR 
[24] PLAB¢CPSID\TV19OT 


Using the pole vector, determine the starting and ending (plus 1) 
indices of the identifiers in the function (as a 2 column matrix, one 
row per identifier): 


[25] IND¢PID/tePID 
[26] NID¢CeIND)+2 
[27] IND¢(NID,2)oIND 


Determine the length of each identifier name: 


C28] IDSTART¢INDE ;010] 
[29] IDLEN¢INDE 31+0I0]-IDSTART 


From here on, Boolean techniques are not required. The techniques 
used are discussed in other chapters, most notably the Positioning 
Character Data chapter and the Sorting and Searching chapter. 


The RELABEL function is presented below in its entirety. The 
function uses the Boolean techniques described above. However, it 
has been modified in a number of ways. The logical partition 
functions (e.g. pANDSCAN) have been replaced by the equivalent logic 
so that RELABEL does not require their presence. The variables have 
been localized. RELABEL also has a left argument: a character 
matrix or vector (blank-delimited) of the names of the labels which 
are not to be renamed. Provide an empty character vector if all 
labels are to be renamed. For example, to relabel the function MODEL: 


ODEF '’ RELABEL OVR ‘MODEL’ Con APL*PLUS) 
3 OFD '’ RELABEL 1 OFD '/MODEL’ Con SHARP APL) 
OFX VRACR '' RELABEL CRAVR OCR 'MODEL’ Cotherwise) 
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C9] 

£10] 
[11] 
[12] 
C13] 
C14] 
[15] 
[16] 
C17] 
[18] 
C19] 
[20] 
C21] 
[22] 
C23] 
[24] 
[25] 
[26] 
[27] 
[28] 
[29] 
[30] 
C31] 


C32] 
C33] 
[34] 
[35] 
C36] 
C37] 
[38] 
C39] 
[40] 
C41] 
[42] 
[43] 
[44] 
[45] 
[46] 
[47] 
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CWSID: FNIDS] 

V R¢KEEP RELABEL VR;ALP3;AN;BL;COLS 3; FOUND; IDLEN;IDS; 
IDSTART ; IND; KLABS ; KLEN ; KSTART; LABS ;NB;NC3;NCCON ;NCMT; 
NEW;NID;NKEEP;NL;NLAB;NLEN;NQ;NSTART;NUM; PAN; PARSE; PBL 
;PID;PLAB;PNUM;PSID;S;START;T;01I0 

Modifies the vector representation (VR) of a 
function such that its labels and references to 
Same are changed to Ll, L2, L3,... for all labels 
but those specified in KEEP, a character matrix 

or vector (blank delimited) of label names. 

Requires subfunction: CMIOTA. 

O1IO0<0 
a Flag newline chars (Boolean partition vector): 
NL€~19VR=OTCNL A APL*PLUS 
A NL¢€ 1O9VR=-OTCLC1I] A APL2 
A NL¢€ 1¢VR=UDAVL156] A SHARP APL 
A Flag nonquotes: 

NO¢VR#’ pre 
A Map of chars not in quote pairs (i.e. char constants) 
a within each fn line (NCCON¢NL pEQSCAN NQ): 
NCCON¢€=\NQANL\T# 110, TE~NL/=\°111,NQ 

NQ<0 

Aa Flag non-a chars (includes as in quotes): 
NC©NCCONAVR='A' 

A Map of chars which do not follow a a Cignoring as 

A within quotes) within each fn line. as are flagged 0. 
A CNCMT¢<NL pANDSCAN NC): 

SENLZ=NC 

NCMT¢~#4\S\T#°110,T€~S/NC 

S¢TENC¢O 

A Map of chars which are not included within as or '’: 
PARSE¢NCMTANCCON 

NCCON¢NCMT€0 

A Flag digits, letters, blanks: 

NUM¢PARSEAVR€E! 0123456789 ' 

ALP*«PARSEAVRE ' ABCDEFGHIJ KLMNOPORSTUVWXYZAabcdefghijkim 
nopgqrstuvwxyzZaA’' 

BL€PARSEAVR=!' ' 

PARSE¢0 

A Flag alphanumeric chars: 

AN¢+NOMVALP 
A Pole vec of contiguous digits: 

PNUM¢NUM# ~-110,NUM 

NUM¢0O 
A Pole vec of contiguous digits/letters: 

PAN¢AN# 110,AN 

AN¢<0O 
A Pole vec of identifiers: 


DDDDDD 


PID¢PAN\TV 10T¢PAN/ALP 

ALP©PAN¢€0O 

a Flag 'O’ before identifiers (Onames): 
T¢1OPID 


T¢T\'O'=T/VR 
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V RELABEL (continued) 
[48] a Shift leading poles of Onames to include QO: 
£49] PID¢éTVPID>° 16T 
C50] T€¢0O 
[51] a Flag char following ] after line no.: 
[52] START¢€~10PNUM\ “10PNUM/ “10NL 
[53] NL€PNUM¢O 
[54] a Pole vec of contiguous blanks: 
C55] PBL€BL4 110,BL 
[56] A Flag 1st nonblank char in each line: 
(57] START¢CSTART>BL)VPBL\ 10PBL/START 
[58] BL€PBL¢O 
[59] A Pole vec of identifiers at start of line: 
C60j] PSID¢PID\TY 16¢T¢PID/START 
[61] START<¢O 
[62] A Pole vec of labels: 
[63] PLAB¢PSID\TV10T¢’:’'=PSID/VR 
[64] PSID<O 
[65] A Start and end (+1) indices of identifiers: 
[66] IND¢PID/tpPID 
[67] A No. of identifiers: 
[68] NID«CeIND)+2 
[691 IND¢(NID,2)pIND 
[70] aA Start indices of identifiers: 
C71] IDSTART¢IND[ 30] 
[721 A Lengths of identifiers: 
[73] IDLEN¢IND[E 31]-IDSTART 
C74] IND€O 
[75] A Map vec of nonblanks in labels to keep: 
[76] NBe'’' '#KEEP¢€,’ ' , KEEP 
[771 A Start indices in KEEP of identifiers: 
[78] NKEEP¢eKSTART¢(NB> 10NB)/tLeNB 
[79] @ Lengths of KEEP identifiers: 
C80] KLEN€(1+(NB>10NB)/tLeNB)-KSTART 
C81] A Length of longest identifier: 
[82] COLS¢€Cl/KLEN) IIT /IDLEN 
(83] A Raveled blank matrix of identifier names: 
[84] IDS€C(NIDXxXCOLS)o’ ' 
[85] a Fill them in (T¢+MONIOTA IDLEN): 
C86] T¢T+LeTEIDLEN/-~110,+\IDLEN 
C87] IDSCT+IDLEN/COLSXtNIDIJI©<VRE[T+IDLEN/IDSTART] 
[88] A Reshape to mat of identifiers: 
[89] IDS¢C(NID,COLS) eIDS 
[90] A Mat of label names: 
[91] LABS¢CCCNID,2)ePID/PLAB)[;0])4IDS 
C92} PID¢PLABEO 
[93] A Raveled blank matrix of labels to keep: 
C94] KLABS<¢(NKEEPXCOLS)o' ' 
[95] A Fill in (T¢MONIOTA KLEN): 
[96] T¢T+LeT¢KLEN/-~110,+\KLEN 
[97] KLABS(CT+KLEN/COLSXLNKEEP]©KEEP([T+KLEN/KSTART] 
C98] A Reshape to mat of labels to keep: 
[99] KLABS¢(NKEEP, COLS ) eKLABS 
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Vv 
C100] 
C101] 
[102] 
C103] 
[104] 
[105] 
[106] 
[107] 
C108] 
C109] 
C110] 
C211] 
[112] 
[113] 
[114] 
[115] 
C116] 
C117] 
C118] 
£119] 
[120] 
C121] 
C122] 
[123] 
[124] 
L125) 
C126] 
C127] 
[128] 
[129] 
C130] 
[131] 
C132] 
[133] 
[134] 
[135] 
C136] 
C1374] 
[138] 
[139] 

V 
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RELABEL (continued) 

Aa Squeeze out labels in KEEP: 
LABS¢(NKEEP=KLABS CMIOTA LABS) #/LABS 
A Row indices in LABS where rows of IDS are found: 
IND¢LABS CMIOTA IDS 

A No. of labels: 

NLAB¢ lee LABS 

A Flag identifiers which are labels: 
FOUND¢CIND<NLAB 

a Start indices of label identifiers: 
IDSTART¢FOUND/ IDSTART 

A Lengths of same: 


IDLEN€FOUND/ IDLEN 

A Indices into LABS of same: 
IND¢FOUND/IND 

A Building new labels: 'LIL2L3...°': 
NEWe’ °',61+ tLNLAB 

a Start indices in NEW of new labels: 
NSTART¢(NEW=' ')/ UL oNEW 


NEWCNSTART]¢’ L’ 

A Lengths of new labels: 

NLEN¢€ C11 NSTART , oNEW) -NSTART 

A Lengths replicated for all label identifiers: 
NLEN¢NLENCIND] 

A Start indices replicated as well: 
NSTART¢CNSTARTLIND] 

A Initialize replication vec: 

R¢(oVR)e1 

A Use Os to squeeze out old identifiers 
A CT¢MONIOTA IDLEN): 
T¢T+LeT¢IDLEN/-~11l0,+\IDLEN 
RCT+IDLEN/IDSTARTI€<0 

A Insert lengths to expand for new identifiers: 
RC IDSTART ]¢NLEN 

A Squeeze/expand vis rep as needed: 
R¢R/VR 

A Adjust for new lengths: 
IDSTART¢€IDSTART++\~110,NLEN-IDLEN 

A Insert new labels (T¢MONIOTA NLEN): 
T¢T+L0OT¢NLEN/-~110,+\NLEN 

RL T+NLEN/IDSTARTJI¢NEW(CT+NLEN/NSTART] 
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PROBLEMS : (Solutions on pages 449 to 458) 


1. What expression will tell you whether all of the elements of the 


numeric vector NVEC are integers? 


Delete the trailing blanks from the character vector CVEC. 


Left justify the rows of the character matrix NAMES (i.e. shift 
each row left until its first character is a nonblank). 


How many numbers (set of contiguous digit characters) are there 
in the character vector INPUT? 


Write the function TIMEADEFINE as described in the Computer 
Efficiency Considerations chapter. 


Write one or more of the functions IDENTIFY, LOCALIZE and 
UNCOMMENT as described in the Workspace Design and Documentation 
chapter. 
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IRREGULAR ARRAYS 


APL derives much of its power from its conciseness and 
consistency. Unfortunately, the real world is not nearly so concise 
and consistent. While APL sees the world as a set of rectangular 
arrays of data, the world is nonrectangular by nature. 


In this chapter, we deal with irregular (nonrectangular) arrays. We 
will present a typical problem which involves irreguiar arrays and 
will attempt to perform a variety of tasks on these arrays. We will 
examine alternative methods available in APL for performing these 
tasks on the irregular arrays. 


As an illustration of irregular arrays, suppose you wish to keep 
track of customer information for a business you operate. The 
following table shows some of the information. 


CUSTOMER INFORMATION 


UNPAID BILLS 


ID TERR 0 seeeaenna--5--------~----- 

NO. NO. NAME DATE INV. NO. AMOUNT 

3 5 ACME CORPORATION 3/15/86 372 586.25 

4/10/86 395 406.15 

4/25/86 407 100.00 

65 3 FASTENERS INC. 4/20/86 405 802.16 
74 5 KLINGLEY & SONS, INC. 

89 2 GHR CORP. 2/12/86 350 5.25 

4/10/86 396 59.60 


How would you store this information in APL variables? 


The ID numbers can be assigned as a vector with one element per 
customer: 


ID€3 65 74 89... 
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Likewise for territory numbers: 
TERRES. 33° 3: Zacks 


The customer names may be stored as a character matrix with one row 
per customer and as many columns as the widest customer name (or an 
arbitrary maximum number of columns). Assuming 500 customers and a 
width of 25 character columns, 


NAME€500 250(¥25T'ACME CORPORATION’) ,(C25T’ FASTENERS INC. ')... 


While this approach to storing the customer names in a matrix is 
tolerable, it has some disadvantages. First, if a customer's name is 
longer than the allowable width Chere 25), it must be abbreviated or 
truncated. Second, short customer names must be padded with blanks 
to the maximum width, implying a waste of storage. Third, when 
extracting a customer’s name from this character matrix, as for a 
form letter, the trailing blanks may need to be deleted, requiring 
more complex programming and more processing time. 


If these disadvantages are not major, you will do well to store the 
names in a character matrix and tolerate the disadvantages. If the 
disadvantages are great enough to be unworkable, you must use some 

other method to work with this irregular information. 


Conceptually, the customer names define an irregular array, a 
"vector" of character vectors. Each "element" of the "vector" is 
itself the character vector name for a single customer. In this 
chapter, we will refer to this type of array as a "nest of character 
vectors" or simply a "character nest". We will refer to the 
character vector "elements" as "items". 


Likewise, the unpaid bills information cannot be fit neatly into a 
conventional APL array. If we assign the values to a 3 column 
numeric matrix, there will not be one row per customer (as there are 
in a character matrix of customer names) and so we must maintain 
additional information which tells us which rows belong to which 
customer. 


Conceptually, the unpaid bills information defines an irregular 
array, a "vector" of 3 column numeric matrices. Each "element" of 
the "vector" is itself the 3 column numeric matrix for a single 
customer. In this chapter we will refer to this type of array as a 
"nest of numeric matrices" or simply a "matrix nest'"'t. We will refer 
to the numeric matrix "elements" as "items". 


In this chapter, we will present 4 different approaches to the 
problem of working with irregular arrays: 


APL2 nested arrays 
APL*PLUS nested arrays 
SHARP APL nested arrays 
Conventional APL arrays 


mm WN 
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You will notice that the APL2 and APL*PLUS implementations of nested 
arrays are Similar. The primary differences are in the function 
symbols chosen for the particular nested array operations. 
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PROBLEM: Construct a character nest of customer names and a matrix 
nest of unpaid bills information. 


TOPIC: Constructing Irregular Arrays 


In APL2, APL*PLUS and SHARP APL, character nests and matrix nests may 
be stored as nested arrays. In particular, they are stored as 
vectors of items, where the items are character vectors (character 
nests) or 3 column numeric matrices (matrix nests). The conventions 
for constructing these arrays are different for each of the different 
implementations of APL. 


In APL2 and APL*PLUS, the arrays may be constructed by "vector 
notation" or "strand notation": 


NAME¢’ ACME CORPORATION’ 'FASTENERS INC.!' ... 
UBILLS€(C3 3031586 372 58625 41086 395 40615 42586 407 10000) 
C1 3042086 405 80216) (0 300) ... 


Alternately, the arrays may be initialized to have the correct number 
of items, after which the items are individually index assigned 
Cassume 500 customers): 


NAMECUBILL€500e0 
NAME[11¢c’ ACME CORPORATION’ 
NAME(C2]<c’ FASTENERS INC. ’ 


UBILLS(1]¢¢3 3031586 372 58625 41086 395 40615 42586 407 
10000 

UBILLS[21¢c1 30e42086 405 80216 

UBILLS[31¢cO 300 


The monadic enclose (¢) function converts its right argument into a 
"nested scalar". In APL2, a tidier notation may be employed to 
perform the index assignment: 
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NAME¢UBILLS€50000 
(1>NAME)¢'ACME CORPORATION’ 
(2>NAME)¢'FASTENERS INC.’ 


(1>UBILLS)€3 3031586 372 58625 41086 395 40615 42586 407 
10000 

(2>UBILLS)€1 3042086 405 80216 

(3>UBILLS)€0 300 


In SHARP APL, the arrays may be constructed by repeatedly applying 
the link (>) function: 


NAME¢'ACME CORPORATION'>'FASTENERS INC.'> ... 
UBILLS€(3 3031586 372 ... 407)>(1 3942086 405 80216)> 
(0 3e0)>... 


Alternately, the arrays may be initialized to have the correct number 
of nested items, after which the items are individually index 
assigned: 


NAMECUBILLS¢500p<0 

NAMEL1]¢<'ACME CORPORATION’ 

NAME[2]¢<'FASTENERS INC.’ 

UBILLS[1]¢<3 3031586 372 58625 41086 395 40615 42586 407 
10000 

UBILLS[2]«<1 3942086 405 80216 

UBILLS[31<<0 300 


Notice that the less than (<) symbol is used to perform the enclose 
function. 


If you already have your customer names in the form of a character 
matrix (CMAT) with one row per name, you may convert the matrix 
directly into a character nest by one of the following: 


NAME¢c[2]1CMAT CAPL2 ) 
NAME¢1[2 ]CMAT CAPL*PLUS ) 
NAME¢<oc1 CMAT CSHARP APL) 


Each of the items of the resulting nest will be a character vector of 
the same length, namely the number of columns in the character matrix 
(CMAT). To delete the trailing blanks from each character vector 
item of a character nest, you must do the following: 


NAMECDTB” NAME C(APL2, APL*PLUS) 
NAME¢(+/A\' '#O0CMAT) 0 '>NAME (SHARP APL) 


where DTB (delete trailing blanks) is a user-defined monadic function 
which deletes the trailing blanks from its character vector right 
argument. For example: 
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V R¢DTB CVEC 
Cl] ReC+/aA\!' '#OCVEC) eCVEC 
V 


When generating reports which include the customer names, you may 
need to convert the character nest back into a character matrix with 
a specified number of columns. Each name must be left-justified ina 
Single row of the matrix. Suppose you want a 25 column matrix. You 
may do this as follows: 


CMAT¢>021]25T NAME CAPL2 ) 

or: CMAT€25T£2jJ>£2 JNAME CAPL2 ) 
CMAT¢+T£1.5125T “NAME CAPL* PLUS ) 
CMAT¢25To>NAME (SHARP APL) 


The second APL2 algorithm is more efficient that the first but will 
require more workspace storage than the first if any name is longer 
than 25 characters. If much longer, a WS FULL may occur. 
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PROBLEM: How can you work with irregular arrays if your 
implementation of APL does not support nested arrays? 


TOPIC: Emulating Nested Arrays on Non-Nested Systems 


Skip this section if your implementation of APL has nested arrays. 


To solve the tasks above using conventional APL, we must devise a 
scheme wherein a character nest or a matrix nest may be stored as one 
or more conventional APL arrays. We will deal first with the 
Character nest problem. 


One approach is to catenate the names together, preceding each name 
by a delimiter character. For example: 


NAME¢’ ®@ACME CORPORATION@FASTENERS INC.@KLINGLEY...!’ 


(In SHARP APL, NAME can be converted into the nested array by the 
expression: ~1¢< NAME). 


Unfortunately, this scheme does not allow an efficient means to 
directly access the name for a specified customer since the array 
must first be searched for the occurrences of the delimiter 
character. For example, the name of the fifth customer may be 
extracted with the expression: 
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1) 05=+\NAME=1o0NAME) /NAME 
which works but is inefficient. 


If the delimiter character is not going to be used to locate the 
substrings of the character nest, you may omit them altogether. Let 
us catenate the names together as a single character vector: 


NAME¢'ACME CORPORATIONFASTENERS INC. KLINGLEY...’ 


We need a second array to tell us where each name starts. One 
possibility is a Boolean "partition" vector which has one element per 
element of NAME and whose 1s mark the corresponding first elements of 
each name: 


NPV|#1 000000000000000d12000000000=~-... 


Together, these two arrays contain all the information you need to 
determine the name for a specified customer. For example, the name 
of the fifth customer may be extracted with the expression: 


(5=+\NPV) /NAME 


(In SHARP APL, NPV and NAME can be converted into the nested array by 
the expression: NPV 1°< NAME). 


Unfortunately, the partition vector approach can be awkward to work 
with and has an inherent flaw: it will not handle an empty name. 

For example, if one of the customer names is unknown, you would 
naturally store it as an empty character vector. However, the 
partition vector requires each name to have at least one element (for 
the 1 in the partition vector). 


Instead of constructing a Boolean partition vector to keep track of 
where the names start and end in the vector NAME, construct a vector 
of the lengths of each name (including Os for empty names). 


NLEN¢16 14 219 15 011... 


The sum of these elements should be the same as the number of 
elements in NAME, i.e. (+/NLEN)=oNAME. Using NLEN, you may extract 
the name of the fifth customer with the expression: 


NAMEC (C7110,+\NLEN)(5)]+UNLEN[5]] 


This expression works properly even for empty customer names Ci.e. 
for elements of NLEN which are 0). 


Note that the (°~110,+\NLEN) portion of the expression is merely 
computing the starting indices (i.e. the index before the first 
character) of the names. To improve the efficiency of the name 
extraction process, you may perform this operation once: 


NSTART¢ 140,+\NLEN 
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after which the extraction process (for customer 5) becomes: 
NAMECNSTART(5]+UNLENCE5]] 


Unfortunately, the price you pay for the greater efficiency is 
greater complexity. You must maintain 3 arrays (NAME, NLEN, NSTART) 
when working with customer names. Any utility functions you write to 
work with this character nest (e.g. to emulate the capabilities 
illustrated above on nested array systems) must cope with at least 
these 3 arguments. 


Utility function design will be greatly simplified if we can weld 
these three arrays into a single array. Unfortunately, numbers (NLEN 
and NSTART) do not cohabitate well with characters (NAME) in the 
arrays of conventional APL systems. (Note: the APL2 and APL*PLUS 
nested array implementations allow "heterogeneous" arrays which 
contain both character and numeric elements.) You must first convert 
them to a common datatype. Suppose we convert the character vector 
NAME into a numeric vector (e.g. OAVLNAME) and then catenate the 
three arrays together, along with the number of customers: 


CNEST¢(CoNLEN) ,NLEN,START,OAVLUNAME 


This array 1S a single object which contains all of the customer name 
information. Since it is a single array, you may design utility 
functions which take CNEST as one argument and the other parameters 
of the problem as the other argument. For example, you can write a 
function EXTRACT whose left argument is this "character nest" array 
and whose right argument is the index of the nest for which the 
character vector name is to be returned: 


CNEST EXTRACT 3 
KLINGLEY & SONS, INC. 


The definition of the EXTRACT function is straightforward: 


V R¢CNEST EXTRACT I;N 

ClJ] N€1TCNEST 

C2] R€EOAVCCNESTC CCNESTC1+N+1I]+1+N+N)+tCNEST£C1+1T11] 
V 


Unfortunately, this definition of a character nest is probably no 
improvement over padded character matrices. On an APL system in 
which small integers (i.e. 1 to 256) are stored with 4 bytes per 
element, each name in this character nest requires 8+4xC bytes, where 
C is the number of characters in the name. Since concern for storage 
is one of the reasons for exploring character nests, a better 
solution is needed. 


A big improvement can be made if you can manage to squeeze more than 
one character into an integer. Since integers are stored in 2 bytes 
or 4 bytes, depending upon your implementation of APL, it is possible 
to translate 2 or 4 characters into a single integer. For example, 

in APL implementations which store integers in 4 bytes, the range of 


S259 


Chapter 16 IRREGULAR ARRAYS 


integers is ~2,147,483,648 to 2,147,483,647. Beyond this range, 
integers are stored in 8 bytes. The procedure for converting 4 
characters into one integer is to convert each character into an 
integer from 1 to 256 (by finding its index in the atomic vector, 
OAV) and then pack these 4 numbers into a number between 
~“2,147,483,648 and 2,147,483,647. Given a 4 element character vector 
CVEC4, a single integer may be produced as follows: 


OoTo<o (origin O is simpler) 
INT¢!  2147483648+256i10AVULCVEC4 


To convert the integer back to a 4 element character vector: 


ODL0O¢0 
CVEC4¢OAVEI (40256) TINT+2147483648] 


Using such a packing and unpacking algorithm, each name in the 
character nest will require only 8+4xfC+4 bytes, where C is the 
number of characters in the name. This is a big improvement in 
storage but comes at the expense of processing efficiency. The 
packing and unpacking takes time. 


Some APL implementations provide primitive functions for converting 
between datatypes, e.g. for converting one 4-byte integer into 4 
1-byte characters or vice versa. Such functions are extremely 
efficient since no modification is made to the internal 
representation of the data, only to the "header" of the variable 
which indicates its shape and datatype. Such implementations are 
ideal candidates for manipulations of character nests by conventional 
means (i.e. not nested arrays). 


APL*PLUS PC is one such implementation. Integers are stored in 2 
bytes (732768 to 32767) per element. The system function ODR (data 
representation) converts between datatypes. For example, the 
expression 

NVEC¢163 ODR CVEC 


converts an N element character vector to an N+2 element integer 
vector. The expression 


CVEC¢82 ODR NVEC 


converts an M element integer vector to an Mx2 element character 
vector. 


Here is one possible design for storing character nests on an 
APL*PLUS PC system (origin 1): 
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CNEST[L1] number of character vector substrings (N) 


CNEST[2 to N+1] index (OIO=0) into this vector of the starts 
of the substrings (S) 


CNEST[S] length of this character substring (L) 


CNEST[S+1 to S+[L+2] integer representation (163 ODR) of this 
character substring, padding with 1 space if 
necessary to produce an even number of 
characters 


For example, the character nest which stores the 3 character 
substrings /TREE’, 'DOG’ and 'ELEPHANT’ is: 


CNEST¢3 4 7 10 4 21076 17733 3 20292 8263 8 19525 
20549 16712 21582 


Note that: 


163 ODR 'TREEDOG ELEPHANT’ 
21076 17733 20292 8263 19525 20549 16712 21582 


Given this design, you may write utility functions which will allow 

you to work with character nests with the same (or greater) ease and 
efficiency as when working with nested array systems. For example, 

to construct the character nest from a character matrix of customer 

names: 


NAME¢CNEST CMAT 


(The CNEST function and all other character nest utility functions 
suggested below are written for the APL*PLUS PC implementation of APL 
as exercises at the end of the chapter.) 


To convert a character nest back into a character matrix with a 
specified number of columns (say 25), use the CNAM function: 


CMAT<¢25 CNAM NAME 


We will deal now with the matrix nest problem. In one regard, the 
problem is more difficult to work with than the character nest 
problem and in another regard it is simpler. It is more difficult 
because the items are matrices instead of vectors and it is simpler 
because the values are numeric, not character, and so do not need to 
be converted into numbers. 


View the matrices as vectors (i.e. ravel them). The number of 
elements in each vector is a multiple of 3 since the matrices have 3 
columns. Viewed in this way, the nest of numeric matrices becomes a 
nest of numeric vectors, which we will call a "numeric nest". 
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One possible design for storing numeric nests is patterned after the 
design proposed above for character nests (origin 1): 


NNEST[1] number of numeric vector segments (N) 


NNEST[2 to N+1] index COIO=0) into this vector of the starts of 
the segments (S) 


NNEST[S] length of this numeric segment (L) 


NNEST(S+1 to S+L] numeric segment 


For example, the numeric nest which stores the 3 segments (30 17 15), 
(25) and (82 93 95 98) is: 


NNEST¢3 4 8 10 3 30 17 15 1 25 4 82 93 95 98 


Given this design, you may write utility functions for working with 
numeric nests. For example, to construct the numeric nest of unpaid 
bills information: 


A Number of elements per customer/segment: 

LEN¢3x3 10 2... 

A Values: 

VAL¢31586 372 58625 41086 395 40615 42586 407 10000 42086... 
A Construct numeric nest: 

UBILLS©LEN NNEST VAL 


(The NNEST function and all other numeric nest utility functions 
suggested below are written as exercises at the end of the chapter.) 


ap Ps Pr NS A A Ps NN Pe Ps PN Pp INS Pa Oe NN Re Ne Ns PD IN ON 


PROBLEM: Add to your customer information database a new customer, 
number 33, TOP DOG LTD., territory 4, which has no unpaid 
bills. 


TOPIC: Catenating to Irregular Arrays 
Use catenation to update the two regular arrays: 


ID¢ID, 33 
TERR¢TERR, 4 
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Likewise, use catenate to update the two irregular arrays if they are 
stored as nested arrays: 


NAMECNAME,¢c'TOP DOG LTD.’ CAPL2, APL*PLUS) 
NAME¢NAME,<'TOP DOG LTD.’ CSHARP APL) 
UBILLS¢UBILLS,cO 300 C(APL2, APL*PLUS) 
UBILLS¢UBILLS,<0O 300 CSHARP APL) 


The enclose function (¢ or <) must be used to construct a nested 
scalar before it is catenated to the nested array. The enclose 
function is not needed when catenating more than one customer since 
vector notation Cor the SHARP APL link function) performs the 
enclosing. For example: 


NAME¢NAME,'’TOP DOG LTD.’ 'BOTTOM CAT CORP.’ CAPL2, APL*PLUS) 
NAME¢NAME,'TOP DOG LTD. ’>'’BOTTOM CAT CORP.’ CSHARP APL) 


Using conventional APL, you may write a function CNCAT which will 
catenate a character nest to a character vector, "enclosing" the 
character vector and catenating it as a new item: 


NAME¢NAME CNCAT ‘TOP DOG LTD.’ 


If both arguments to CNCAT are character vectors, the result is a 2 
item character nest in which each item comes from a corresponding 
argument. This allows you to catenate more than one customer name at 
a time via the notation: 


NAME*CNAME CNCAT ‘TOP DOG LTD.’ CNCAT 'BOTTOM CAT CORP.’ 


Likewise, a function NNCAT may be written which will catenate two 
numeric nests. Unfortunately, the function cannot readily discern 
between an argument which is a numeric nest and an argument which is 
a Simple numeric vector. (With CNCAT, a character nest is numeric 
and a character vector is character.) One way to solve this problem 
is to write 4 functions (NNCATSS, NNCATVS, NNCATSV, NNCATVV) which 
will respectively handle the 4 possible combinations of arguments. 
For example, NNCATVS will catenate a numeric nest ("vector") left 
argument to a simple numeric vector ("scalar") right argument: 


UBILLS<¢UBILLS NNCATVS ,0 300 


You may catenate the unpaid bills information for more than one 
customer at a time via the notation: 


UBILLS¢UBILLS NNCATVV (,0 300) NNCATSS ,1 3040986 400 1000 
Another approach is to provide the "scalar" (i.e. not numeric nest) 
argument as a matrix. Then, the function can tell its numeric nest 


(vector) arguments from its non-numeric-nest (matrix) arguments: 


UBILLS©UBILLS NNCAT (0 3e0) NNCAT 1 3040986 400 1000 
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PROBLEM: Return the name and unpaid bills information for the 5th 
customer, as a character vector and a 3 column numeric 
matrix. Return the names and unpaid bills information for 
the 2nd, 8th and 12th customers, as a 3 item character nest 
and a 3 item matrix nest, respectively. 


TOPIC: Selecting from Irregular Arrays 


Using nested arrays, the 5th name and matrix of unpaid bills 
information may be returned by selecting and disclosing (converting 
from a nested scalar to a simple array): 


N¢>NAMEL5] (APL2, APL*PLUS) 
Ue>UBILLS[5] 

N¢>NAMEL[5] (SHARP APL) 
U¢>UBILLS[5] 


The select-and-disclose operation is performed so frequently when 
working with nested arrays that a specific function ("pick") is 
available in APL2 and APL*PLUS to do just that: 


N€5°>NAME CAPL2, APL*PLUS) 
U¢S>UBILLS 


To select several items from a nested vector to return a subset 
nested vector, you may use indexing directly: 


N3¢NAME[2 8 12] CAPL2, APL*PLUS, SHARP APL) 
U3¢UBILLS[2 8 12] 


Using conventional APL, you may write functions CNIDX and NNIDX which 
will extract the desired information: 


N¢NAME CNIDX 5 
U¢UBILLS NNIDX 5 
UelCCCeU)IF3),3) 00 


N3¢NAME CNIDX 2 8 12 
U3¢UBILLS NNIDX 2 8 12 


These functions return nests (character or numeric) if the right 
argument is a vector and simple arrays (character or numeric vector) 
if the right argument is a scalar. Since the unpaid bills 
information is stored as a vector, the third statement above is 
necessary to reshape the resulting vector into a 3 column matrix. 
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PROBLEM: Replace the name and unpaid bills information of the 5th 
customer by the character vector 'NEW CORP.’ and by the 3 
column numeric matrix NEWBILLS. Replace the names and 
unpaid bills information of the 2nd, 8th and 12th customers 
by those of the 4th, 5th and 6th customers. 


TOPIC: Replacing Items of Irregular Arrays 


Using nested arrays, the 5th name and matrix of unpaid bills 
information may be replaced by any of the following: 


NAME[T5J]<c'NEW CORP. ' CAPL2, APL*PLUS) 
UBILLS[5]¢€cNEWBILLS 

(5>NAME)<'NEW CORP.’ CAPL2 ) 
(5>UBILLS ) ©NEWBILLS 

NAME[5]¢<'NEW CORP. ' (SHARP APL) 
UBILLS(5]¢€<NEWBILLS 


The 2nd, 8th and 12th customers may be modified directly by indexing 
and index assignment: 


NAMEC2 8 12]¢NAME[4 5 6] CAPL2, APL*PLUS, SHARP APL) 
UBILLS[2 8 12]¢UBILLS[4 5 6] 


Using conventional APL, you may write functions CNIDXA, NNIDXA and 
ASSIGN which will replace the desired information: 


NAME¢NAME CNIDXA 5 ASSIGN 'NEW CORP.’ 
UBILLS¢UBILLS NNIDXA 5 ASSIGN ,NEWBILLS 


NAME¢NAME CNIDXA 2 8 12 ASSIGN NAME CNIDX 4 5 6 
UBILLS¢UBILLS NNIDXA 2 8 12 ASSIGN UBILLS NNIDX 4 5 6 


The function ASSIGN is a simple function provided for your 
convenience. The CNIDXA and NNIDXA functions require 3 arguments 
Coriginal nested array, indices to be replaced, simple or nested 
array to be inserted). Since APL syntax allows only 2 arguments, the 
3rd argument must be passed as a global variable. The function 
ASSIGN assigns its right argument to the global variable <assign> and 
returns its left argument as its explicit result. 


The CNIDXA and NNIDXA functions require a nest (Cin <assign>) if their 


right argument is a vector; they require a simple array if their 
right argument is a scalar. 
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PROBLEM: What is the average length CANLEN) of customer names? 
How many unpaid bills (NUBILLS) exist? 


TOPIC: Determining Shapes of Irregular Items 


Using nested arrays, you must determine the shape of each item and 
then manipulate the resulting vector accordingly: 


ANLEN¢(+/é€o0 NAME )+oNAME CAPL2 ) 
ANLEN¢ (>+/o NAME) +eNAME CAPL*PLUS ) 
ANLEN¢(+/,0°>NAME)+oNAME (SHARP APL) 


Each of these expressions requires one more step than intuition 
suggests (€, > and ,). This additional step is needed because the 
result after performing the ©” Cor os>) function is still a nested 
vector (or 1 column matrix), whose items (or rows) are each one 
element vectors, since monadic oe always returns a vector. The array 
of one element vectors must be converted into a simple vector of 
scalars. 


The solutions for the second part of the problem are similar: 


NUBILLS¢+/fT''e ‘UBILLS C(APL2) 
NUBILLS¢+/>"'9 ‘UBILLS CAPL*PLUS ) 
NUBILLS¢+/,0 ~“1lo°e>UBILLS (SHARP APL) 


Since the items of UBILLS are 3 column matrices, logic is included 
(T°, >” and ,O ~“11) to look at only the number of rows in each matrix 
(i.e. the first element of the shape of each matrix). 


Using conventional APL, you may write functions CNLEN and NNLEN which 
return a vector of the lengths of the items in the specified 
character or numeric nest: 


ANLEN¢(+/CNLEN NAME)+1TNAME 
NUBILLS¢(+/NNLEN UBILLS)+3 
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PROBLEM: Sort the database of customer information by customer name, 
alphabetically. 


TOPIC: Sorting Character Nests 


To sort the database of customer information, we need the grade-up 
vector which would put the names in alphabetically sorted order. 
Given this vector, say GRADE, the database may be reordered as 
follows: 


ID¢ID{€ GRADE] 

TERR¢TERR(E GRADE J 

NAME*CNAME[ GRADE] (nested arrays) 
UBILLS¢UBILLS [GRADE] 

NAME¢NAME CNIDX GRADE Cconventional APL) 


UBILLS<UBILLS NNIDX GRADE 


How do we determine GRADE? Using nested arrays, you must first 
convert NAME into a character matrix and then apply 4 or CGRADEUP 
(see Searching and Sorting chapter) on the character matrix. 


ALP¢’ .,;:-/ABCDEFGHIJKLMNOPORSTUVWKXYZ ’ 
GRADE¢ALP4 NAME CAPL2 ) 
GRADE*ALP4A$6$ ; NAME (SHARP APL) 
GRADE*CALP4T C>T /9 NAME) T “NAME CAPL*PLUS ) 


Using conventional APL, you may write a function CNGRADEUP which 
returns the grade vector that may be used to reorder the character 
vector items of a specified character nest into sorted order. 


GRADE©ALP CNGRADEUP NAME 


The CNGRADEUP function may be written employing techniques discussed 
in the Searching and Sorting chapter. As such, it does not need to 
convert its character nest right argument into a character matrix 
with as many columns as the longest item. In this way, the CNGRADEUP 
function is superior to the algorithms listed above for nested 

arrays. Each of these algorithms constructs a character matrix from 
the nest. If one name is very long, a WS FULL error may be generated. 
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PROBLEM: How many times does the name '/FASTENERS INC.’ occur as a 
customer name? Which customer is ‘ABC CORP.'? Which is 
'DEF CORP.'? Which is 'GHR CORP.'? 


TOPIC: Searching Character Nests 


Using nested arrays, the number of times the name ‘FASTENERS INC.’ 
occurs may be determined by one of the following: 


+/NAMEEC'PFASTENERS INC. ’ CAPL2, APL*PLUS ) 
+/NAMEé€<'FASTENERS INC.’ (SHARP APD) 


Using conventional APL, you may write a function CNEQ which compares 
each item of a specified character nest to a specified character 
vector and returns a bit vector with one element per item in the 
character nest, the 1s corresponding to items which match the 
character vector. 


+/NAME CNEQ 'FASTENERS INC.’ 


The CNEQ function may take two character nest arguments, or one 
character nest and one character vector argument (in either order) or 
two character vector arguments (returning a bit scalar). In other 
words, its behavior is parallel to that of the primitive scalar 
dyadic function equals (=). 


The second part of the problem begs for an index result. Therefore, 
dyadic iota (lt) is the logical choice. Using nested arrays, 


NAMElL’'ABC CORP.’ 'DEF CORP.’ 'GHR CORP.’ CAPL2, APL*PLUS) 
NAMEL’ABC CORP. '>'DEF CORP. '>'GHR CORP.’ (SHARP APL) 


Using conventional APL, you may write a function CNIOTA which 
searches through its character nest left argument for the first 
occurrence of each of the items in its character nest right 

argument. If the right argument is a character vector, the result is 
a scalar. 


NAME CNIOTA 'ABC CORP.’ CNCAT ‘DEF CORP.’ CNCAT 'GHR CORP. ' 
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PROBLEM: Determine the total amount of unpaid bills for each 
customer, returning a numeric vector AMT with one element 
per customer. 


TOPIC: Reducing Numeric Nests 


Using nested arrays, the "each", "on" or "with'' operators are needed: 


AMT¢3>°°+/ UBILLS CAPL2, APL*PLUS) 
AMT¢+/(€f/,0 “lle eo>UBILLS)To>,o>(<0 2)1° >UBILLS (SHARP APL) 


Using conventional APL, you may write a function NNSUMCOL which will 
sum the Nth column of each M-column matrix item stored as a numeric 
vector item in the specified numeric nest. 


AMT<¢UBILLS NNSUMCOL 3 3 


The first 3 in the right argument of NNSUMCOL is the number of 
columns in the raveled matrix items. The second 3 is the index of 
the column to be summed. 
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PROBLEMS: CSolutions on pages 459 to 473) 


1. Suppose you have a numeric nest (vector of numeric vectors) named 
SALES which contains one item per customer and for which each 
item is a vector of the customer’s monthly sales. The length of 
each vector is a function of how long the customer has been 
generating sales. What expression will produce a vector (AVG) 
with one element per customer, where each element is the average 
monthly sales for the corresponding customer? 


2. Given a character nest NAMES, what expression will return a 
character nest of its distinct (unique) items (UNAMES)? 
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3. Write one or more of the character nest functions mentioned in 
this chapter for your implementation of APL (if it does not have 
nested arrays). The functions are: CNEST, CNAM, CNCAT, CNIDX, 
CNIDXA, ASSIGN, CNLEN, CNGRADE, CNEQ, CNIOTA. 


4. Write one or more of the numeric nest functions mentioned in this 
chapter if your implementation of APL does not have nested 
arrays. The functions are: NNEST, NNCATSS, NNCATVS, NNCATSV, 
NNCATVV, NNCAT, NNIDX, NNIDXA, ASSIGN, NNLEN, NNSUMCOL. 
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CURVE FITTING 


In both the business and scientific disciplines, the need 
occasionally arises to fit mathematical curves to empirical data. 
The process of curve fitting is a wonderful, magical process. Like 
most magic, curve fitting is illusion based upon reality. The 
reality is that rigorous mathematical algorithms are applied by the 
computer to determine the precise parameters of a "best" curve. The 
illusion is that this "best" curve somehow possesses more knowledge 
than the data: that it can be used to predict the future. This can 
be a dangerous and foolhardy view. 


This is not to say that curve fitting is useless. Far from it. 
Curve fitting is not only useful; it is fun. Just be careful that 
you are not lured into thinking that the computer’s intuition is 
better than your own. 


In this chapter, we discuss the most complex of all the APL primitive 
functions, quad-divide (H). The chapter does not presume that you 
understand the concepts of numerical analysis and linear algebra 
needed to appreciate the algorithms applied within quad-divide. 
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PROBLEM: Given the following sample data and the function of a 
curve, find the coefficients of the curve. 


T xX Y 2 


Curve: T=C(CAXX)I+(BxXY)+0CxZ) 


Problem: find scalars A, B, Cc 


2 
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TOPIC: Using Quad-Divide 


To provide some meaning to this example, suppose the three 
observations represent three salespeople. The first salesperson 
received no base salary (X) last year, was paid $3 in stock options 
(Y) and received no bonuses (Z). He generated 9 new customers (T). 
The second salesperson received $1 in base salary, $2 in stock 
options, $3 in bonuses and generated 20 new customers. The third 
salesperson received $4 in base salary, #1 in stock options, #2 in 
bonuses and generated 19 new customers. 


We have a hypothesis that the number of new customers generated by a 
salesperson (T) is a direct function of the base salary (X), stock 
options (Y) and bonuses (Z), and that the function is in the form: 


T= (AXX)+CBxY)+(€CxZ) 


If we can determine the constants (coefficients) A, B and C in this 
formula, we can predict the number of new customers (T) which will be 
generated by a salesperson who is paid a specified base salary, stock 
option amount and bonus amount. Since we have three equations 
(9=3xB; 20=A+(2xB)+(3xC); 19=(4xA)+B+(2xC)) in three unknowns (A, B, 
C), the problem can be solved by algebraic manipulations and 
substitutions. After tedious work, we discover that A=2, B=3, C=4. 


Quad-divide (H) can be used to solve this problem directly. Provide 
the dependent values (T) as its vector left argument, and the 
independent values (X, Y, Z) as its 3 column matrix right argument. 
The result is a vector of the three desired constants (coefficients). 


T¢9 20 19 
XYZ¢3 300 30123412 
C+THXYZ 
Cc 
234 


Once the coefficients are known, it becomes a simple matter to apply 
the formula to hypothetical values. For example, how many new 
customers do we expect to be generated by a salesperson who is paid 0 
in base salary, #3 in stock options and $4 in bonuses? 


0 3 4+.xC A €COxA)I+03XB)+C4xC) 
25 


In order to solve problems like these (systems of linear equations), 
the data must satisfy certain requirements. For one thing, we must 
provide one equation (observation) for each unknown. Since there 
were three unknown constants (A, B, C) in the above example, we 
needed three observations. If we do not provide enough observations, 
there are an infinite number of solutions (values of A, B, C) which 
will satisfy the specified observations. Quad-divide will generate a 
DOMAIN ERROR. 
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Second, even if we provide one observation per unknown, there may be 
insufficient data to determine a unique solution. For example, in 
the following set of data, the third observation provides the same 
information as the second. The values are simply doubled. 


No amount of algebraic manipulation will produce a unique solution. 
To have a unique solution, each observation must be independent. 
That is, no observation may be a linear combination of the other 
observations. 


If observations are not independent, quad-divide will generate a 
DOMAIN ERROR. The matrix right argument is said to be "singular" and 
cannot be "inverted". No unique solution exists. 


Third, if we provide more observations that there are unknowns, there 
will likely be contradictory data so that no single set of 
coefficients will satisfy all of the observations. In such an 
instance, the best we can hope for is a set of coefficients which 
defines a formula which is reasonably accurate for all of the 
observations, though it may not be precise for any of them. 


This 1s, in fact, what quad-divide returns. It does so by using the 
method of least squares. Simply stated, it returns those 
coefficients which when applied against the independent values will 
return a dependent value (the "expected") which is as close as 
possible to the specified dependent value (the "actual'"). One 
measure of "noncloseness" (called the "mean squared error") is the 
mean of the squares of the differences between each of the actuals 
and their corresponding expecteds. The algorithm within quad-divide 
determines the set of coefficients which produces the smallest value 
of this noncloseness measure (hence, "least squares"). 


To illustrate this "best fit" behavior of quad-divide, let us add one 
more salesperson to our example and then determine the coefficients: 
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Teg. 20. 19 IY 
XY¥Z€4 3c0 3 01234122221 
CeTHXYZ 
Cc 
1.6667 2.6667 4.3333 


We may determine the expecteds by applying the computed coefficients 
against the matrix of independent values: 


BEXYZ+. XC 
E 
8 20 18 13 


While only one of these values exactly matches the actuals (T), the 
other values are reasonably close. The mean squared error is: 


((E-T)+.*2)+0E 
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PROBLEM: During the last 5 years, the annual sales figures for your 
firm have been 27, 29, 35, 41, 44. If you assume that 
sales are growing at a linear rate, what do you project 
sales to be next year? 


TOPIC: Forecasting 


Forecasting is just curve fitting in disguise. Instead of expressing 
a formula in terms of several independent variables (such as X, Y, 
Z}, we express it in terms of time (T). In this problem, we are 
searching for a formula which looks like: 


SALES =A+(BxT ) 


where T is some measure of time (such as 1, 2, 3, 4, 5 for the last 5 
years) and A and B are the constants to be determined. 


If we express our data in the same form as the data in the previous 
section, the solution becomes obvious. 
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The column of ls is used to indicate that the first constant (A) is 
multiplied by 1 in each observation (i.e. in SALES=A+(BxT)). 


MAT¢5 201112131415 
SALES€27 29 35 41 44 
C+SALESHMAT 


Cc 
21.4 4.6 


The two elements in C are our constants A and B. To project the 
sales for next year, we need only plug the time 6 into our formula: 


1 6+.XC 
49 


If we wish to see how closely our formula tracks the past five years, 
we can apply the constants to the time periods 1 to 5: 


MAT+.xC 
26 30.6 35.2 39.8 44.4 


We can compare these values to the actual sales (27, 29, 35, 41, 44). 


Suppose we assume that sales are growing at a quadratic rate instead 
of a linear rate. The formula instead looks like: 


SALES =A+ (BxT)+CCxTx2 ) 


Our data can be expressed in an expanded table: 
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The solution is no more complex: 


MAT¢€(1t5)0.*0 1 2 
SALES€27 29 35 41 44 
C¢SALESHMAT 
Cc 

22.4 3.742857 .142857 


Nor is the projection: 


1 6 36+.xC 
50 


Nor is the tracking: 


MAT+.xC 
26.2857 30.4571 34.9143 39.6571 44.6857 


Notice that a quadratic formula tracks better than a linear one. 

This in no surprise since the linear formula is just a special case 
of the quadratic formula (the case in which the coefficient of the 
squared term is 0). If the line fits better than any other quadratic 
form, the third coefficient will be zero. 


After realizing that a quadratic formula fits better than a linear 
formula, we may jump to the conclusion that:'a cubic formula (e.g. 
SALES=A+(BxT)+(CxT42)+(CxT*3)) fits better still. And it does. 
However, we must bear two thoughts in mind before carrying this logic 
to its extreme. The first is that we must provide at least one 
observation per unknown constant. Otherwise, quad-divide will 
generate a DOMAIN ERROR because there are an infinite number of 
possible solutions. Hence, for our 5 observations (years), the most 
terms we can have in our formula is 5. 


The second thing to consider is that the reason a higer-order formula 
fits better to the data is that it exhibits more helter-skelter 
behavior than a lower-order formula. It has more mood swings. Thus, 
the formula which is selected by quad-divide to perfectly match your 
5 observations may send you into outer space when you project the 
sixth period. 


The bottom line in this discussion is that despite the difficulty of 
the number-crunching task faced by the computer, your task is more 
difficult. It is your job, after all, to determine which formula you 
think your data fits. There are an infinity of possible formulas. 
The computer merely has to crunch out coefficients. 
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PROBLEM: As in the previous section, we wish to project forward one 
time period the sales figures from the past 5 years (27, 
29, 35, 41, 44). We believe the data conforms to the 
formula, SALES=+(CA+(CBxT)). 


TOPIC: Fitting Data to a Nonlinear Formula 


Our problem here is that we need to determine two constants (A and B) 
but the formula is not expressed as an additive combination of two 
terms, each multiplied by one of the constants. We must transform 
the formula so that it is ina form suitable to quad-divide. 


If we multiply both sides of the formula by A+(BxT), we get: 
1=(SALESxXA)+(CSALESXBxT ) 
From this linear form, we can solve directly for A and B: 


SALES€27 29 35 41 44 
TIME¢.5 
MAT<«SALES, [1.5 1SALESxTIME 
C+( (pSALES)01)HMAT 
Cc 

.040625 ~3.7567 


The projection follows directly: 


+C+. x1 6 
55.295 


While we have accomplished our projection via this transformation, we 
must concede that quad-divide provides us the best coefficients for 
the transformed formula, not for the original formula. Hopefully, 
this distinction is not important. However, we must bear it in mind 
whenever we transform a formula. 


Let us try fitting the data to another formula which is nonlinear and 
requires a transformation. 


Formula: SALES=Ax*BxT 


Transformed formula: (®SALES )=(@A)+(CBxT) 


Solution: MAT¢TIME°o.*0 1 
C¢( @SALES )HMAT A Returns: (@®A) and B 
(*C£L11])x*CL2]x6 
51.426 
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PROBLEM: In the previous sections, we use four different formulas to 
project 5 years of sales figures one year: 


- SALES=A+tCBxT) 

SALES =A+ (BxT)+(CCxT*2 ) 
»- SALES=+(CA+CBxT) ) 
SALES =AX*BxT 


PWN 


For each of these formulas, we obtain a different set of 
coefficients and a different projection (sales figure for 
the sixth year). Which projection is the best? 


TOPIC: Finding the Best Formula 


The best projection is that which exactly matches the actual. 
Unfortunately, the actual sixth year sales figure is not yet known. 
Therefore, we must determine the best projection in another way. 
Let's choose the projection that comes from the formula which most 
closely fits the data. (Bear in mind that the best fitting curve is 
not necessarily the best projector of the data.) 


Which formula fits the best? Just as the mean squared error is used 
to determine the best coefficients for a given formula, so too can it 
be used to determine the best formula for a given set of data. The 
formula which produces the least mean squared error is the best 
fitting formula. Remember: the mean squared error is the mean of 
the squares of the differences between each of the actuals and thier 
corresponding expecteds. 


Let us compute the mean squared error for each formula: 
SALES€¢27 29 35 41 44 
TIME¢1 23 4 5 
1. SALES=A+(BxT) 
MAT¢TIMEoO.*O 1 
C¢SALESHMAT 
EXP¢MAT+. xC 


( (SALES-EXP)+.*2)+oSALES 
1.04 
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2. SALES=A+(CBxT) + CCxTx2 ) 


MAT¢TIMEo.*0 1 2 

C¢SALESHMAT 

EXP¢MAT+. xC 

((SALES-EXP)+.*2)+oSALES 
0.983 


3. SALES=+(CA+CBxT) ) 


MAT¢SALES ,[1.5]SALESXTIME 

C¢( (eSALES)e1)HMAT 

EXP¢+(TIMEo.*0 1)+.xC 

( (SALES-EXP)+.*2)+oSALES 
1.85 


4. SALES=AX*BxT 


MAT¢TIMEoO.*0 1 

C¢« (@®SALES ) HMAT 

EXP¢(*Cl011)x*«CL2)]xXTIME 

((SALES-EXP)+.*2)+eSALES 
1.10 


The smallest mean squared error for the above formulas is the one for 
the second formula. Therefore, the second formula is the "best" 
formula (of the four selected) and the best projection is the one 
from that formula (50). 
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PROBLEMS : (Solutions on pages 474 to 475) 


1. Each month, you buy 4 styles of envelopes from your paper 
supplier (styles A, B, C, D). The supplier has never bothered to 
itemize the different styles on the invoice. The invoice shows a 
total amount only. Suppose you bought the following quantities 
of envelopes during the previous 4 months: 


Total 
A B 6: D Invoice 
32 61 15 82 $60.62 
35 104 10 82 $59.57 
37 83 5 85 $56.70 
25 62 14 85 $60.42 
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Assuming prices have remained constant during this period and 
that the invoice does not include volume discounts or other 
credit or debit adjustments, what are the unit prices for the 4 
styles of envelopes? 


During the last 2 months, you have been making weekly checks of 
your inventory of muffler bearings. The results are given here: 


Week: 1 2 3 4 5 6 7 8 9 
On hand: 1850 1772 1705 ---- 1508 1490 ---- ---- 1250 


(During week 4 the plant was closed and during weeks 7 and 8 you 
were on vacation.) Assuming the supply of bearings is being 
depleted at a steady (linear) rate, when will the supply be 
exhausted (assuming no restocking)? 


What is the radius and center of the circle which best fits the 
points (4,1), (3,2), (2,3), (4,5), (7,2), (6,4), i.e. for which 
X¢4 3 24 76 and Y¢1 23 5 2 4? The general formula for a 
circle is: 


CR*e2) = CCX-CX)*2) + COY-CY) 42) 


where R is the radius, and (CX,CY) is the center. 


In the Sorting and Searching chapter, a function CMIOTA is 
presented for searching through the rows of one character matrix 
for the location of the rows of a second. The function is 
designed to use one of two different algorithms depending upon 
the number of rows in its arguments. In the Computer Efficiency 
Considerations chapter, the two algorithms are timed for a 
variety of different size arguments (in a problem at the end of 
that chapter). Suppose you have constructed 4 vectors which 
contain the results of those timings: 


L: The number of rows for left arguments; 
R: The number of rows for right arguments; 
Tl: The average time required to run CMIOTA using the first 


Csorting) algorithm for a left argument with L rows and a 
right argument with R rows; 
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T2: The average time required to run CMIOTA using the second 
(looping) algorithm for a left argument with L rows and a 
right argument with R rows. 


According to the logic in the Sorting and Searching chapter, the 
vectors are related according to the approximate formulas: 


Tl 
T2 


C4+(C5xXCR+L) ) 
C1+ CRxX(C2+(0C3xL))) 


WA 


Determine the values for Cl, C2, C3, C4 and C5. You may replace 
the like-named variables in the CMIOTA function so that it will 
use the fastest algorithm for any combination of arguments in 
your APL environment. 
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FINANCIAL UTILITIES 


In a variety of business oriented computer applications, 
sophisticated financial calculations are required. Many of these 
calculations deal with the time-value-of-money concept, i.e. the 
concept of interest. In this chapter, we develop utilities which 
handle some of the more common financial calculation requirements. 


This chapter does not provide a comprehensive treatment of the theory 
of interest or of financial analysis. It provides only as much 
material as is needed for you to grasp the important concepts. Nor 
does it provide a comprehensive library of financial software. [It 
provides some generally useful utility functions. 
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PROBLEM: You are considering the purchase of a new machine. The 
amount you must borrow from the bank is $225,000 to be 
repaid in monthly installments at 13% interest over 10 
years. What will be the monthly payment? 


TOPIC: Interest and Annuities 


We will work toward the solution to this problem gradually. 


Suppose you deposit $100 in your bank account at the beginning of the 
year. If at the end of the year your balance is $112, you have 
earned $12 on your #100 deposit and so your "effective" (annual or 
APR) interest rate is .12 (12+100). At the end of two years (if 
interest rates remain the same), you will have $125.44 (#112 + 
$112x.12 or $112x1.12). After three years, you will have $140.49 
($125.44x1.12). And so on. In general, after N years, you will have 
$100x1.12*N. The factor 1.12 (1 plus the interest rate) is called 
the annual "accumulation factor". 
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Suppose we look at your banking situation in a reverse way, from the 
present back into the past. If you have a $100 balance today, how 
much must you have had in the account one year ago? If we call the 
desired amount AMT, we know from the logic above that AMTx1.12 = 
$100. Therefore, AMT = $100+1.12 = $89.28. Two years ago, you had 
$79.72 ($89.28+1.12). And so on. In general, N years ago, you had 
$100¢1.124N. The factor +1.12 (the reciprocal of the annual 
accumulation factor) is called the annual "discount factor"). 


If instead of paying you 12% per year, the bank offers to pay you 1% 
per month, how much money will you have at the end of the year? 
Since the monthly accumulation factor is 1.01, you will have $101 
($100x1.01) after one month, $102.01 (#101x1.01) after two months, 
$103.03 (#$102.01x1.01) after three months, ... and $112.68 
($111.57x1.01 or $100x1.01%*12) after twelve months. Since you have 
earned $12.68 on your $100 deposit, your effective interest rate is 
.1268. This rate may also be called a "nominal" rate of .12, 
compounded monthly (€.12 = .01x12 months). The term "compounded" (Cor 
converted) refers to the periodic process of applying the 
accumulation factor to the balance to derive the new balance. 


Since an interest rate may be expressed in "effective" or "nominal" 
terms, it is important to understand the distinction between the two 
and to be able to convert between them. The relationship between 
them follows: 


C1+EINT) = C1+NINT+CONV) «CONV 


where: 
EINT = effective interest rate (e.g. .1268) 
NINT = nominal interest rate (e.g. .12) 
CONV = number of interest conversion (compounding) periods 
per year 


It is instructive to note that for a given nominal interest rate, the 
equivalent effective interest rate increases as the conversion 
frequency increases. That is, the more frequently the bank compounds 
your nominal interest, the more you will have at the end of the 

year. However, the amount of increase in the effective interest rate 
drops off as the conversion frequency gets bigger and bigger. [In 
fact, it makes little difference whether you compound weekly or every 
second. This brings up the concept of "continuous" compounding. 

That is, by using a bigger and bigger value of CONV in the formula 
above, the value of (1+EINT) will eventually Cwhen CONV becomes 
infinitely large) become constant. This value (called the "force of 
interest") may be computed by using a large value of CONV (say, 
10000) or by using the following formula (which may be derived 
mathematically): 


C1+EINT) = *NINT (when CONV is infinitely large) 
Because of marketing appeal or because of the mathematical 


simplicity, financial institutions occasionally quote interest rates 
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"compounded continuously". You may convert them to equivalent 
effective rates by applying the above formula. 


The following functions may be used to convert between nominal and 
effective interest rates. 


C[WSID: INTEREST] 
V E¢C EFFECTIVE N;R3S;T 
Converts the nominal interest rates in N, 
compounded C times per year, into effective 
(annual) interest rates. A value of ~1 for 
C implies continuous compounding. C and N 
may have any shapes as long as they are 
scalar conformable. 


ra 

1 

Lt 
DDDDDDOD 


Determine shape of result: 

[9] S¢oC+N 

[10] A Construct all zero raveled result: 

[11] RepE€(x/S)e0 

[12] aA Convert arguments to same-length vectors: 
[13] Cé€Rpec 

[14] N¢RoN 

[15] a Select rates with noncontinuous compounding: 
[16] Tecf'1 

[17] ECT/veTIJ€¢~1+0€1+CT/NI+T/C)*T/C 

[18] a Select continuous rates: 

C19] TeT 

[20] ELT/itpT]¢71+*T/N 

[21] A Reshape result: 

C22] E¢SoeE 
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C[WSID: INTEREST] 
V N¢eC NOMINAL E;R3S;T 
Converts the effective (annual) interest rates 
in E into nominal interest rates compounded C 
times per year. A value of “1 for C implies 
continuous compounding. cC and E may have any 
shapes as long as they are scalar conformable. 


ent 

fr 

| oe | 
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Determine shape of result: 
C8] S¢eCtE 
[9] aA Construct all zero raveled result: 
[10] ReeN€¢Cx/S)e0 
[11] A Convert arguments to same-length vectors: 
C12] C€Rec 
[13] E¢RoE 
[14] aA Select rates with noncontinuous compounding: 
C15] Tec# 1 
[16] NEIT/rveTI]¢CT/C)x-14+€14+T/E)*+T/C 
[17] a Select continuous rates: 
[18] TeT 
C19] NUT/i.epTI<@1+T/E 
[20] A Reshape result: 
[21] N€SoN 
V 


Let us extend our discussion to include annuities. An "annuity" is a 
regular series of payments. Suppose, for example, you plan to 
deposit #100 in your bank account at the beginning of each year for 
the next 10 years. If the effective interest rate is .12, what will 
be your balance at the end of 10 years? 


The first payment will have compounded 10 times, the second 9 times, 
and so on. The sum of these amounts is the balance: 


BALANCE = (€100x1.12*10) + €100X1.12*9) +...+ (€100x1.12*1) 


By dividing each side of this equation by 1.12 to get a second 
equation, by subtracting the second equation from the first equation 
and by algebraic manipulation, we discover that: 


BALANCE = 100x1.12x(€(€1.12*«10)-1)+.12 


By performing the calculations, we learn that the "future value" of 
this 10 year annuity is $1,965.46. 


Suppose we look at a 10 year annuity in a reverse way, from the 
beginning of the annuity rather than the end. What is the value 
today (the "present value") of an annuity in which you deposit $100 
at the beginning of each year for the next 10 years? That is, what 
single amount can you deposit today that will compound to $1,965.46 
at the end of 10 years, i.e. that will result in the same future 
value as will the annuity? 
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Tf PV (present value) is the amount we are seeking, the equation we 
need to solve is: 


(PVx1.12*10) = 100xX1.12x((C1.12*10)-1)+.12 
from which we get: 
PV = 100x1.12x(1-1.12%°10)+.12 


By performing the calculations, we learn that the present value of 
this 10 year annuity is $632.83. That is, a single payment today of 
$632.83 is equivalent to a 10 year annuity of #100 starting today. 
Each will compound to $1,965.46 at the end of 10 years. 


Having presented some examples to expose you to the concepts of 
compounding, annuities and present value, let us now take a leap 
forward and work with more general annuities. First, some 
definitions: 


TERM: the number of years during which the annuity is paid; 
PAY: the annual payment amount; 
PER: the number of payments per year (PAY+PER per payment); 


PDEF: the fraction of a payment period preceding (deferring) 
each payment; O if the payment occurs at the beginning of 
the payment period and 1 if at the end; 


DEF: the number of years from the valuation date to the first 
period; 0 if the present value of the annuity is desired 
and -TERM if the future value is desired; 


CONV: the number of interest conversion (compounding) periods 
per year; 


INT: the nominal annual interest rate. 


We will illustrate these terms by depicting a sample annuity on a 
"time" diagram. Below is the time diagram of an annuity for which 
the present value is desired. The annuity will be paid for 3 years 
(TERM=3), will be $200 per year (PAY=200) paid semiannually (PER=2) 
at the end (PDEF=1) of each half year payment period and will 
commence after 1 year (DEF=1) without payments. The present value is 
to be computed at a nominal interest rate of 12% CINT=.12) compounded 
monthly CCONV=12). 


-286- 


Chapter 18 FINANCIAL UTILITIES 
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The general formula for the value of an annuity is: 
VAL = PAYX(V*DEF+PDEF+PER) X(C1-V*«TERM)+PERX1-V*+PER 
where: 


VAL: the value (present, future or otherwise) of an annuity 
as of DEF years before the first payment period; 


V: the annual discount factor, computed from INT and CONV, 
V = C1+INT+CONV) *-CONV 


Some modifications to this formula are needed for some possible 
extreme conditions. First, if TERM is infinite (such an annuity is 
called a "perpetuity"), the V*ATERM term of the formula becomes 0 and 
may be omitted. Second, if CONV is infinite (continuous interest 
compounding), the definition of V becomes: 


Vo = *-INT 


Third, if PER is infinite C"continuous payments", a theoretical 
concept), the PDEF+PER term of the formula becomes O and may be 
omitted, and the PERxX1-V*+PER term becomes -®V. 


Given this formula and these possible modifications, a niladic 
function VALUE can be written which computes the value of an annuity 
given all of its parameters as global variables. The global 
variables have the same names as the parameters defined above except 
the first letter is from the alternate character set (tERM, pAY, pER, 
PDEF, dEF, cONV, iNT). Infinite values are represented by the value 
“1 (for tERM, pER or cONV). The global variables may have any shape 
as long as they are all scalar conformable with one another, that is 
as long as the expression, 


tERM+pAY+pER+pDEF+dEF+cONV+iNnT 
does not generate a RANK ERROR or a LENGTH ERROR. 


All of the global values Cexcept iNT) have default values which are 
used if that global variable does not exist. 
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CWSID: INTEREST] 
V VAL¢VALUE;CONV;:DEF;E;INT;N;NDPP; PAY; PD; PDEF;PER;R; 
SHAPE ; TERM; V 
[1] aA Returns the present or future value of a stream 
[2] a of uniform cash flows defined by the optional 
[3] aA global variables: 
[4] aA 
[5] a default name description 
P64! oh) - = Ses— Sa SSS SS SS ae eS ee Sa eS See 
[7] A 1 tERM no. years of payments (”1l=perpetuity) 
[8] aA 1 pAY annual payment amount 
[9] A 1 pER no. payments per year (71=continuous) 
[10] a 1 pDEF is payment at start (0) or end (1) of 
[11] a payment period Cor fractional) 
[12] aA 0 GdEF no. years from valuation date to 
[13] a first payment period 
[14] a 1 cONV no. interest conversion (compounding) 
[15] a periods per year (~1=continuous) 
[16] A = iNT nominal annual interest rate (or force 
[17] a of interest if cONV="1) 
C18] A 
[19] A Determine values from globals or defaults: 
[20] E¢xONC Ne’ tERM’ 
[21] TERM¢?CE/N),(C~E)/'1' 
[22] Ee«xONC Ne'pay’ 
[23] PAY¢?CE/N),(C~+E)/'1' 
[24] EexONC Ne'pER’ 
C25] PER¢@CE/N),(~E)/'1’ 
[26] EexONC Ne’ pDEF’ 
C27] PDEF¢?CE/N),(C~E)/'1' 
C28] E¢xONC Ne'dEF’ 
[29] DEF¢?(E/N),(~E)/'0’ 
[30] E¢xONC Ne’coONnv’ 
C31] CONVe?CE/N),(C~E)/'1' 
C32] INT¢iNT 
C33] A 
C34] A Construct result as a vector; reshape when done: 
C35] SHAPE¢eTERM+PAY+PER+PDEF+DEF+INT+CONV 
[36] R¢x/SHAPE 
[37] TERM¢Ro TERM 
[38] PAY¢RoPAY 
[39] PER¢RoePER 
[40] PDEF¢RoPDEF 
[41] DEF¢RoDEF 
[42] INT¢RpINT 
[43] CONV+ReCONV 
[44] a Convert interest to annual discount factor: 
[45] V€¢Re0 
[46] A Noncontinuous compounding: 
[47] E€¢CONv#'1 
C48] VLIE/teVI€C1+CE/INT)+E/CONV) *-E/CONV 
[49] A Continuous compounding: 
[50] Ee-E 
[51] VLE/vieV1¢*-E/INT 
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V VALUE (continued) 
[52] A 
[53] aA Period deferral as a fraction of a year: 
[54] E¢PER#°1 
[55] PD¢EXPDEF+PER+~E 
[561 A 
[571 A Determine nominal Cby payment period) discount: 
[58] NDPP<«Roe0 
[59] aA Noncontinuous payments: 
[60] NDPPLE/teNDPPIJ<¢CE/PER)X1-CE/V)*+E/PER 
[61] A Continuous payments: 


C62] EervE 
[63] NDPPLE/tepNDPPIi¢-®@E/V 
C64] A 


[65] A Value at valuation date: 

[66] E¢TERM# 1 

C67] VAL©SHAPEe PAYX (V*PD+DEF) x(1-ExV*EXTERM)=+NDPP 
V 


Using the VALUE function, you can now answer the problem which began 
this section. The repayment schedule of a loan is simply an 
annuity. The present value of that annuity is exactly equal to the 
amount borrowed from the bank. We know the following: 


tERM¢10 

pER¢12 

pDEF¢1 

dEF<¢O 

cCONV¢12 (probably, but ask the bank) 
iNT¢.13 


The only unknown parameter is pAY, whose value we seek. However, we 
know the present value (225000) which will be the result of VALUE if 
we provide the correct value of pAY. We may use VALUE iteratively, 
modifying the value of pAY in a trial and error fashion until the 
result of VALUE is 225000. Alternatively, since we know that the 
result of VALUE is directly proportional to the value of pAY, we may 
arbitrarily set pAY to 12 (i.e. 1 per month) and divide the result of 
VALUE into 225000 to determine the factor by which pAY must be 
multiplied to produce the desired present value. 


pAY€12 


225000+VALUE 
3359.49 


This is the monthly payment. 


If you wish to experiment with the rate of interest to determine its 
effect upon the monthly payment, you may do it as follows: 


iNT€.10 .11 .12 .13 .14 .15 


225000+VALUE 
2973.39 3099.38 3228.1 3359.49 3493.49 3630.04 
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To produce a table of the monthly payments for each of these six 
interest rates, for 5, 10, 15 and 20 years, you may do it as follows: 


INT€.10 .11 .12 .13 .14 .15 o.+ 000 0 
tERM¢O 00000 °.+ 5 10 15 20 


9 26225000+VALUE 


4780.59 2973.39 2417.86 2171.30 
4892.05 3099.38 2557.34 2322.42 
5005.00 3228.10 2700.38 2477.44 
5119.44 3359.49 2846.79 2636.05 
5235.36 3493.49 2996.42 2797.92 
5352.73 3630.04 3149.07 2962.78 


Be Bs Ps A PI AS A Pa A, Pa Bs A INS Aw NS A A Po As A A Au NS PA PN MA A 


Having borrowed $225,000 from the bank to be repaid in 
monthly installments at 13% interest over 10 years, how 
much will you have paid after one year? Of that amount, 

how much will have gone toward payment of interest and how 
much toward repayment of the original principal of #225,000? 


PROBLEM: 


TOPIC: Loan Amortization Schedules 


From the function presented in the prior section, we can answer the 
first part of the problem. 


tERM¢10 

pER¢12 

pDEF¢1 

dEF<0 

CONV€12 

iNT¢.13 

pAY¢1 

O<¢ANNUAL€225000+VALUE 
40313.9 


This annual loan repayment consists of an interest portion and a 
principal repayment portion. To determine the two portions, we need 
only determine the remaining principal at the end of the year. The 
difference between the original principal (225,000) and the 
remaining principal is the amount of principal repaid. The 
difference between the total annual payment ($40,313.90) and the 
principal repaid is the interest paid. 


So how do we determine the remaining principal at the end of the 
year? The principal is the present value of the remaining payments. 
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That is, the amount you owe the bank at any given time (the 
principal) is equivalent to the value at that time of the remaining 
payments. For this reason, the bank is indifferent (theoretically) 
to whether you repay the principal of the loan or continue to make 
the loan payments. 


The principal at the end of the year may be computed by the following: 


tERM¢9 

pAY<ANNUAL 

O+REMAINING¢VALUE 
213252.48 


The principal repaid and interest paid follow directly: 


O+PRINCIPAL€225000-REMAINING 
11747.52 

O-INTEREST¢ANNUAL- PRINCIPAL 
28566. 38 


By applying the logic in this section, we may write an APL function 
which will generate a "loan amortization table". A loan amortization 
table shows the breakout of each loan payment into its interest and 
principal repayment portions. The monadic function SCHEDULE takes a 
6 element vector right argument which defines the parameters of the 
loan: 


[1] TERM: the number of years of loan repayment; 
[2] PAY: the annual repayment amount; 
[3] PER: the number of payments per year; 


[4] DEF: the number of years from the loan to the first 
repayment period; 


[5] CONV: the number of interest conversion (compounding) 
periods per year; 


[6] INT: the nominal interest rate. 
The result of SCHEDULE is a two column matrix of the loan 
amortization table. The matrix has one row per loan payment 
(TERMXPER). The values in the first column are the portions of each 
payment which represent principal repayment. The values in the 
second column are the portions which represent interest payments. 
We may solve the problem above as follows: 

MAT¢SCHEDULE 10,ANNUAL,12 0 12 .13 

where ANNUAL was computed above by the VALUE function. MAT has 120 


rows Cone row per loan payment). The principal repaid and interest 
paid during the first year are: 
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11747.52 28566.38 


FINANCIAL UTILITIES 


There are two identities about the loan amortization table which 


should be 


noted. 


1 


(+/MAT)A.=ANNUAL+12 


The sum of each row is the periodic payment amount: 


The total of the principal repayment column is the original loan 


amount: 


[25] 
[26] 
[27] 
C28] 
[29] 
[30] 
C31] 
[32] 
C33] 
[34] 
C35] 


225000=+/MATE 51] 


TERM;V; VAL 


row per payment: 


MATE 31] 
MATE 52] 


PARAMS[ 1] 
PARAMS [2 ] 
PARAMS [3 ] 
PARAMS [ 4 ] 


TERM 
PAY 
PER 
DEF 


PARAMSE5] CONV 


PARAMS[6] INT 


DDDDIDIADIDIDID DID IDIAAI VID IDII DIO OO 


TERM¢PARAMS[ 1] 
PAY¢PARAMS[ 2 ] 
PER¢PARAMS[ 3 ] 
DEF¢PARAMS[ 4] 
CONV¢PARAMS[5 J 
INT¢PARAMS [6] 


[WSID: INTEREST} 


V MAT<¢SCHEDULE PARAMS ;CONV; DEF;INT; INTER; PAY; PER; PRIN; 


Returns a two column matrix of the loan payment 
schedule for the loan defined by the parameters 
in the vector right argument. 


The result has one 


principal repaid 
interest paid 


The parameters in the argument are: 


no. years of payments 

annual payment amount 

no. payments per year 

no. years from date of loan to 
first payment period (0=no 
repayment deferral) 

no. interest conversion 
(compounding) periods per 
year (7~1=continuous) 

nominal annual interest rate 
Cor force of interest if 
CONV=" 1) 


A Convert interest to annual discount factor: 
A Branch if noncontinuous compounding: 


>(CONV4 “1)eL1 


A Continuous compounding: 


Vex*x-INT 
>L2 
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VY SCHEDULE (continued) 
[36] aA Noncontinuous compounding: 
[37] L1:V¢C1+INT+CONV) *-CONV 
[38] A Determine present value (principal balance) at 
[39] A start of each payment period: 
[40] L2:VAL¢( PAYXV*+PER) x€1-V* COLTERMXPER) +PER)+PERX1-V*+PER 
[41] aA Reset first value to loan amount if repayment 
[42] aA is deferred: 
[43] VAL(C1LI¢VALC11]xV*DEF 
[44] a Principal repaid: 
[45] PRIN¢VAL-1/VAL,0O 
[46] a Interest paid: 
[47] INTER<(CPAY+PER)-PRIN 
[48] aA Adjust interest to keep it from exceeding 
[49] A periodic payment: 
[50] INTER¢C+\INTER)JL+\CoINTER) oPAY+PER 
[51] INTER¢CINTER-~110,INTER 
[52] PRIN¢(CPAY+PER)-INTER 
[53] MAT¢PRIN,C1.5J]INTER 


Pe PN Pe PS Pe BA BN Ps NG ON me Os se Oe Pu NS A A Pu I 


PROBLEM: You have been presented an opportunity to invest $10,000 
of your money now in exchange for the promise of several 
future cash payments. The schedule of payments follows: 


May 12, 1987 You pay $10,000 
Jan. 1, 1989 You receive $2,000 
Jan. 1, 1990 $3,000 
Jan. 1, 1991 $4,000 
Jan. 1, 1992 $5,000 
May 12, 1992 $1,000 


Is this a good investment? Are you better off leaving your 
money in the bank at 10%? 


TOPIC: Internal Rate of Return 


One approach to this problem is to explore two scenarios. In the 
first scenario, you deposit $10,000 in your bank now (May 12, 1987) 
and leave it there. At the end of 5 years (May 12, 1992), at 10% 
effective interest, your money has compounded to $16,105.10 
($10,000x1.10%*5). In the second scenario, you deposit nothing now 
but instead deposit $2,000 on Jan. 1, 1989, $3,000 on Jan. 1, 1990, 
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and so on as in the schedule above. Compute the balance as of May 
125. 19925 


To compute the balance for the second scenario, you must convert the 
dates to more manageable measurements of time. By using the TODAYS 
function presented in the Manipulating Dates chapter, you can 
translate the dates into days from May 12, 1992: 


DAYS¢TODAYS 19890101 19900101 19910101 19920101 
19920512 
O¢DAYS¢DAYS([5]-DAYS 
1227 862 497 132 0 


We can translate these days into "years" by dividing by 365. Then, 
it is a simple matter to accumulate each amount (the annual 
accumulation factor is 1.10) for the corresponding number of years. 


2000 3000 4000 5000 1000+.x1.10*DAYS=365 
17242.31 


Since $17,242.31 is greater than $16,105.10, you will be better off 
to accept this investment opportunity than to leave your $10,000 in 
the bank. 


Will the conclusion be the same if the rate of interest offered by 
the bank is 12%? How about 15%? The higher the rate of interest, 
the less important are future cash flows and the more important are 
current flows. At what interest rate will the accumulated value of 
the investment cash flows be exactly the same as the accumulated 
value of your money left in the bank? That is, at what interest rate 
will you be indifferent Cignoring the riskiness of the investment) 
between making the investment and leaving your money in the bank? 
This rate of interest is called the "internal rate of return" (IROR) 
of the investment cash flows. If you can realize a higher interest 
rate than the IROR by an alternative investment (such as leaving your 
money in the bank), you should not invest. However, an investment is 
a good investment if the IROR of its cash flows is higher than any 
alternative investment. 


To determine the IROR of a set of cash flows, you must employ an 
iterative (trial and error) technique known as "successive 
approximations". Your task is to determine the interest rate (I) for 
which the following equation is true: 


CLOOOOxC1+I)*TO) = C2000xKC1+LII*AT1) + C3000xC1+I)*T2) + ... 


where TO, Tl, T2, ... are the numbers of years from the corresponding 
cash flow to the date of the last cash flow. 


You may try different interest rates and see which produces the best 
results. From these observations you can choose better interest 
rates and try again. By repeating this procedure, you will gradually 
narrow in on the correct IROR. 
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To speed up this process, you may apply a technique known as the 
Newton-Raphson Method which uses the results of the previous guess to 
make an intelligent next guess. The guesses converge to the correct 
result much faster than standard interpolation procedures because the 
method considers the derivative (slope) of the formula. 


The method is: 
NEXTI = fCLASTI)+f£' (LASTI) 
where: 
NEXTI: the next interest rate to try; 
LASTI: the previous interest rate tried; 
f(I): the formula as a function of the interest rate I; 
f'(I): the derivative with respect to I of the formula f(I). 


Given this method, we may write a function IROR which returns the 
internal rate of return for a specified set of cash flows. The left 
argument is a vector of the dates (YYYYMMDD) of the cash flows and 
the right argument is a vector of the amounts of the corresponding 
flows, where outflows are represented as negative numbers and inflows 
are represented as positive numbers (or vice versa). The solution to 
this problem is then: 


DATES€19870512 19890101 19900101 19910101 19920101 
19920512 
AMTS€~10000 2000 3000 4000 5000 1000 
DATES IROR AMTS 
0.12165 


From this, we conclude that the investment should be made if we have 
available no alternative investments which will generate more than 
12.1654. 


If an investment opportunity involves several outflows interspersed 
among the inflows, it is possible that the flows may not define a 
distinct internal rate of return. That is, several different 
interest rates may be used to accumulate (Cor discount) all the 
outflows to the same value as the inflows. Beware. 


The IROR function is listed below. Note that it expresses the 
formulas as present value formulas in terms of the annual discount 
factor V (i.e. the reciprocal of the accumulation factor) rather than 
in terms of the interest rate. Also note that the iterations stop 
once the discount factor is determined within .0000001 or when 10 
iterations have occurred. If the result is not determined in 10 
iterations, the result is set to 0. Modify the function if you 
desire greater accuracy or more iterations or if you wish to try a 
starting value other than 10%. 
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[WSID: INTEREST] 
V INT¢DATES TIROR AMTS;DAYS;DIFF;I1;TAMTS;TYRS;V;YRS 
Computes the internal rate of return, as an 
effective (annual) interest rate, for the stream 
of cash flows defined by the left argument vector 
of dates Cin YYYYMMDD format) and the 
corresponding right argument vector of amounts. 
Positive amounts are inflows and negative amounts 
are outflows. Requires subfunction: TODAYS. 


rc 

Oy 

Law 
DDDDDIDDIDO 


Translate dates into days since Feb. 29, 0000: 
C10] DAYS¢TODAYS DATES 

[11] A Translate days into years (365 days per year) 
[12] A since the day of the first cash flow: 

[13] YRS¢CDAYS-L/DAYS)+365 

[14] aA Precompute factors needed in formula below: 

C15] TAMTS¢AMTSXYRS 

C16J] TYRS¢€°1+YRS 

[17] aA Start with an effective interest rate of 10 pct.: 
[18] Verzl.l 

[19] A Number of iterations performed so far: 

[20] tI1¢€0 

[21] A Apply Newton-Raphson to get new V: 

[22] LOOP: V¢V-DIFF¢ CAMTS+. xV*YRS)+TAMTS+.xV*TYRS 

[23] A Exit if done (change less than .0000001): 

[24] 3(1E~72IDIFF) DONE 

[25] aA Branch to next iteration unless 10 itns. already: 
[26] 3(10>I«I+1) LOOP 

[27] A Else set discount factor to 1 if unknown in 10 itns.: 
[28] Vel 

[29] DONE: INT¢ 1++V 
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PROBLEM: Suppose you purchase a bond on February 17, 1987 for 
$4225. On April 1, 2003 (the maturity date), the bond will 
mature and you will receive $5000 (the par or redemption 
value). From the purchase date to the maturity date, you 
will receive semiannual interest payments (coupons) of #300 
on April 1 and October 1 of each year. (The annual coupon 
rate is 12%, i.e. .12=(2x$300)+%5000). In addition to the 
$4225 purchase price, you must pay the seller his portion 
of the upcoming (April 1) coupon. The computation of this 
payment (the accrued interest) is a proration based upon 
the number of days of the 6 month coupon period during 
which the bond was held by the seller. What is the 
internal rate of return of this investment? 


-296- 


Chapter 18 FINANCIAL UTILITIES 


TOPIC: Bond Calculations 


The internal rate of return of a bond is called its "yield to 
maturity". The IROR may be determined by the methods described in 
the previous section. That is, you may construct a vector of the 34 
dates on which cash flows occur (19870217 19870401 19871001 19880401 

20030401) and a corresponding vector of the amounts payable on 
those dates. You may then use the IROR function directly. 


However, since the cash flows of the bond consist of a simple annuity 
(the coupon payments) and a single maturity payment, a relatively 
simple formula exists to describe the present value of the bond's 
future cash flows. By taking the derivative of this forumla (tedious 
but not difficult) and applying the Newton-Raphson Method, the yield 
to maturity may be determined in a few iterations. 


Suppose you need to determine the yields to maturity for an entire 
portfolio of bonds (say 500 bonds). If you use the former approach 
(the IROR function), you will need to create the date and amount 
vectors individually for each bond (since they may be of different 
lengths and may involve different dates). If you use the second 
approach (the formula), you may perform the successive approximation 
process on all the bonds at once. After a few iterations, you will 
have the yields for all the bonds. 


The function FCYIELD C''fixed coupon yield"), listed below, uses this 
latter approach to compute the yield to maturity for one or more 
bonds. Once the yield is determined for a given bond, that bond is 
excluded from the successive approximation process. This function is 
a extremely efficient solution to a problem which is iterative by 
nature and is generally viewed as a "processing hog" when solved 
using APL. 


The result of FCYIELD is a "couponly" rate (e.g. a 6 month rate for 
semiannual coupons). To convert the result to an effective (Cannual) 
rate, use the EFFECTIVE function presented earlier in this chapter. 


One final note: this approach uses the formula for a regular 
annuity. However, because of leap years and months of irregular 
lengths, the coupon payments are not perfectly regular. The FCYIELD 
function (Cand the investment community in general) assumes that the 
year consists of 12 30-day months. Dates which have a day portion of 
31 (e.g. 5/31/87) are treated like the 30th day of the same month 
Ce.g. 5/30/87). Therefore, the results are not as precise as they 
would be if more care were taken when counting days. However, for 
most purposes the accuracy of the results is adequate. 
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CWSID: INTEREST] 


V YLD¢FCYIELD PARAMS ; COST; COUP; CRATE;CRY;DAYS;DIFF;F;F1; 


A 
A 


me) 
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F2;F3;1;IND;MCOST;MDATE;:MORE;MVAL;N;NCOUPS 5s NEW; OK; PAR; 
PDATE;RY;3W;Y 


Returns couponly yield rates for fixed-coupon 
securities defined in PARAMS, one row per security. 


PARAMS[ ;:1] 
[3:2] 


par value 

purchase cost excluding accrued 
interest (no coupon rec'd nor interest 
paid if purchased on a coupon date) 
purchase date (CYYYYMMDD) 

maturity date (YYYYMMDD) 

annual coupon rate 

number of coupons per year 

Coptional) maturity value (par value if 
omitted) 


C33] 
C34] 
[35] 
C;6] 
[37] 


PAR¢MVAL¢PARAMSE 311] 
COST¢PARAMSL 32] 

PDATE¢PARAMSE ; 3 ] 

MDATE¢PARAMS[ 341] 

CRATE*¢PARAMS[ ;5]+NCOUPS¢PARAMST ; 6] 
>(6=11oPARAMS ) oSTART 
MVAL¢PARAMSL ;7 ] 


Formula: 
COST = (€C€1+Y)*4-F)xCMVALX(1+Y)*-W)+PARXCRATEX1+ 
C1-C1+Y)*«-W)tY 
Where: Y = couponly yield rate 
F = the fraction (0<Fs1) of a coupon period 
from PDATE to the next coupon 
W = the number of whole coupon periods 
remaining from PDATE to MDATE (less 1 if 
purchased on a coupon date) 
COST = purchase cost including accrued interest 


Convert the formula to a function in Y by moving COST 
to the right side: 

£CY) = C€-COST)+(CC1+Y)*-F)x(CMVALX(1+Y)*-W)+PARx 
CRATEX1+(01-€1+Y)*-W)+Y 


The problem is to determine Y for which f(Y)=0. Solve 
by the method of successive approximations, using 
different values of Y. Start by trying Y=CRATE. Then 
use Newton-Raphson method to determine successive 
values of Y: 


YON+1) = YCONIJ-fFCYCN)I)+£' CYCN)) 
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V FCYIELD (continued) 
[50] a Determine f’(Y) by taking the derivative of f(Y). 
[51] a After tedious computations: 
[52] A 
[53] A f' CY) = CC1+Y¥)*4-F)xCCC14+Y)*7~1-W) x C (W+F) xC PAR 
[54] A CRATE+Y)-MVAL) + (CPARXCRATEX1+Y)+Y*2)- 
[55] a PARXCRATEX (CF+Y)++Yx*2 
[56] A 
[57] a Let us define the following: 
[58] A 
[59] A FI=1+Y F2=(C1+Y)*-F F3=(€C1+Y)*-W COUP=PARXCRATE 
[60] A N=F+W RY=<+Y CRY =COUP=+Y MCOST=-COST 
[61] A 
C62] A The formulas for f(CY) and f'CY) become: 
[63] aA 
[64] A FCY) = MCOST+F2x  CMVALXF3 )+COUP+CRYX1-F3 
[65] aA 
[66] A f'CY) = F2xCCF3+F1)xXCNxXCRY-MVAL)+F1xXCRYXRY ) 
C67] A ~CRYXF+RY 
C68] A 
[69] A Compute approx days (360 days/yr) from purchase to 
[70] A maturity (change 31 days to 30): 
(71] START: DAYS¢€ 360 30 1 +.x O 100 100 TMDATE-31=100IMDATE 
[72] DAYS¢DAYS- 360 30 1 +.x O 100 100 TPDATE-31=1001PDATE 
[73] A No. coupon periods from purchase to maturity: 
[74] N¢CDAYSXNCOUPS )+360 
[75] A Fractional and whole coupons from purch to matur: 
[76] FeN-WelN+°1 
C77] COUP¢+PARXCRATE 
[78] A Include accrued interest (prorated) in purch cost: 
[79] COST¢+COST+COUPXx1-F 
[C80] MCOST¢-COST 
[81] A Start with couponly rates from approximate yield 
[82] Aa formula: 
C83] YLD¢Y¢CMVAL+MCOST+NxXCOUP) + CMVALXN)+(CCN+1)xCOST-MVAL) +2 
[84] A Indices into YLD of yields not yet known: 
[85] IND¢ieYLD 
[86] A Number of next iteration: 
[87] TI¢l 
[88] A 
C89] LOOP: F1¢1+yY 
C90] F2¢F1*-F 
[91] F3¢F1*-W 
C92] CRY«COUPXRY¢+Y 
[93] A Apply Newton-Raphson to get new Y: 
[94] DIFF¢CMCOST+F2x (MVALXF3 )+COUP+CRYX1-F3)+F2x((F3+F1)x(N 
XxCRY-MVAL)+F1xCRYXRY)-CRYXF+RY 
[95] NEW¢Y-DIFF 
[96] A Flag those found (changed less than .0000001): 
[97] OK¢1E™721DIFF 
[98] A Compute indices of remaining elts: 
[99] MORE<(C~OK)/tp0OK 
[100] A Branch if no elts found: 
[101] 7C€CeO0K)=pMORE) opNEXT 
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V FCYIELD (continued) 
[102] a Insert found elements: 
C103] YLDCOK/IND]<OK/NEW 
[104] a Exit if no remaining elts: 
[105] ~»(€xXeMORE)JEND 
[106] A Else, squeeze down arrays: 
[107] MVAL¢MVAL[ MORE] 
[108] Fe¢FLMORE] 
C109] WeWCMORE] 
[110] N€¢NCMORE] 
[111] COoUP«¢COUPI[MORE ] 
[112] MCOST+MCOSTIMORE ] 
C113] IND¢IND[ MORE] 
[114] A Update current yields to latest values: 
[115] NEXT: Y¢NEWC MORE ] 
[116] A Branch to next iteration unless 10 itns. already: 
[117] 2(102I¢I+1)eLOOP 
[118] aA Else, set yield to 0 if unknown in 10 iterations: 
[119] YLDLINDI¢O 
[120] A 
[121] END: 
V 


We will use the two approaches to solve the problem stated at the 
beginning of this section. 


Approach 1: Using IROR 
(note: purchase price includes $226.67 accrued interest) 
DATES€19870217,(,€10000X1986+116)°.+401 1001) ,20030401 
AMTS¢€~ 4451.67,(320300) ,5300 
DATES IROR AMTS 
0.150274 

Approach 2: Using FCYIELD 

Y¢FCYIELD 1 605000 4225 19870217 20030401 .12 2 


2 EFFECTIVE 2xyY 
0.150305 


The slight difference between these two yield rates is the result of 
using an exact-days assumption (Cand 365 days per year) in the first 
approach, and a 30-days-per-month, regular annuity assumption in the 
second approach. 


Ba NG Bs Ns Be PR Pe Pro Pee Ps PS Aa A Pe Os Pe PRs Pe Ps Pre Oe Ps A A PN 


-300- 


Chapter 18 FINANCIAL UTILITIES 


PROBLEMS : (Solutions on pages 476 to 478) 


1. You deposit #1000 in an 18 month Certificate of Deposit at 11% 
compounded daily. What will be the value of your deposit when it 
matures? 


2. You make a deposit of $10 each week for 40 weeks into your bank's 
Christmas Club plan. Your bank pays 8%, converted monthly. How 
much will you have at the end of the 40 weeks? 


3. You borrow $10,000 from your bank to buy a car. The term of the 
loan is 4 years and the interest rate is 14%. What will your 
monthly payment be? If you decide to repay the loan after 3 
years, how much must you pay the bank (assuming no prepayment 
penalty)? How much interest will you have paid during the 3 
years? 


4. Your brother-in-law is opening a new restaurant and has 
approached you with the opportunity to invest in his venture. He 
is asking for an immediate outlay of $10,000 and a second outlay 
of $5,000 in 6 months. Starting 5 years from now, he will pay 
you $3,000 a year for 15 years. What is the internal rate of 
return of this investment (assuming all payments will be made as 
scheduled)? 


5. When the purchase price of a bond is less than its par (face) 
value, the bond is said to be selling at a "discount". When its 
price is greater than its par value, it is selling at a 
"premium". In general, a bond sells at a discount when interest 
rates are higher than the bond's coupon rate, and sells ata 
premium when interest rates are lower than the bond’s coupon 
rate. Fluctuating interest rates hence cause inverse 
fluctuations in the market values (purchase costs) of bonds. 
When interest rates are up, bond prices are down and vice versa. 
As the maturity date of a bond gets nearer, the fluctuations of 
its market value are less pronounced. On the maturity date, the 
market value of the bond is exactly equal to its par value, 
regardless of prevailing interest rates. 


When you buy a bond, its value on your books (its "book value") 
is the purchase cost. When the bond matures, its value on your 
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books is the par value. These two values are not the same when 
you buy the bond at a discount or a premium. Since the book 
value of the bond changes from the purchase date to the maturity 
date, and since you do not want the change to appear as an abrupt 
change at maturity, you must "amortize" the amount of the 
discount or premium over time, modifying the book value of the 
bond accordingly. 


In general, the book value of a bond on a given date is computed 
by using its yield-to-maturity and the present value formula in 
the FCYIELD function to determine the present value of the bond 
on the coupon dates before and after the given date and by 
interpolating between the two dates. 


Write a function FCBOOK (fixed coupon book value) which returns 
the book values of a specified portfolio of bonds as of a 
specified date. The left argument of FCBOOK is the scalar date 
CYYYYMMDD) as of when the book values are to be computed. The 
right argument is a matrix of bond parameters with one row per 
bond. The 6 columns contain, respectively, par value, maturity 
date (YYYYMMDD), annual coupon rate, number of coupons per year, 
couponly yield rate (as from FCYIELD), maturity value (par value 
if omitted). The result is a vector of book values with one 
element per bond (Crow of the right argument). 
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EXCEPTION HANDLING 


In this chapter we discuss the concepts of exception handling in 
APL, and illustrate exception handling techniques on some of the 
popular implementations of APL. 


An exception is an event which, if not handled, will cause a function 
to suspend. Specifically, it is an error or an attention (pressing 
the BREAK key). When an exception occurs and is not handled, 
diagnostic information displays and the user is left in immediate 
execution mode. In other words, the function being executed no 
longer has control of what will happen. For better or worse, the 
user must decide what action to take next. 


The concept of exception handling is that facilities are provided to 
enable the programmer to insert code into a system which will detect 
an exception when it occurs and will take some action other than 
Simple suspension. 


What kinds of action is the function (i.e. the programmer) likely to 
take when handling an exception? There are 4 typical choices: 


1. Do something and then return to immediate execution mode with 
the function suspended on the exception line. This is the 
default Cunhandled) behavior where the "something" is to 
display diagnostic information. 


2. Do something and then resume execution at the exception line. 
Hopefully, the "something" (e.g. allocating more disk storage) 
removes the cause of the exception so that the line will 
complete without exception this time. 


3. Branch to another line of the exception function where special 
logic has been included to evaluate the exception and to take 
appropriate action. After taking such action, the function may 
choose to branch back to the exception line or to branch 
elsewhere. 


4. Leave the exception function altogether by signalling an error 
Ci.e. exception) to the environment which called the exception 
function. This is the behavior taken by primitive APL 
functions. For example, if you attempt to divide by 0, the 
divide primitive (+) does not suspend within its assembler code 
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but rather signals an error (DOMAIN ERROR) to the function line 
on which divide was called: 


DOMAIN ERROR 
CALCLi5] A€BeC 
A 


Likewise, it may be desirable to have the exception function 
(e.g. SQRT) signal an error to its calling environment: 


DOMAIN ERROR 
CALC(L25] At¢SORT B 
A 


The calling environment may then handle or not handle the 
exception as appropriate. 


There are a number of different implementations of exception handling 
in APL. Each of these implementations takes a different approach to 
allow the 4 choices above. We will illustrate the 3 major exception 
handling implementations (APL*PLUS, SHARP APL, APL2) in this chapter. 
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PROBLEM: Given a character vector named INPUT which contains a 
user-entered APL expression, execute the expression and 
return its result. If the execution of the expression (or 
its assignment to the result variable R) generates an error, 
report the error and reprompt by branching to the line 
labeled PROMPT. 


TOPIC: Detecting the Error 


In APL*PLUS, the system variable OELX Cerror latent expression) may 
be assigned any executable character vector (as with OLX). The 
character vector is executed only if an error occurs. The niladic 
system function ODM (diagnostic message) returns a character vector 
representation (with embedded newline, i.e. carriage return, 
characters) of the diagnostic message of the most recent exception. 
The default setting of DELX in a clear workspace is ‘ODM’. 
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The solution to the problem using APL*PLUS: 


C0] cee 2UB GA 

C1] DELX<’ODM’' If an error occurs on lines 2 to 14, 
: display the diagnostic message and 

: suspend. 

[15] OELX¢’>ERR’ Branch to ERR if an error on line 16. 
[16] ®'Re' , INPUT 

C17] OELX¢’' ODM’ 

[25] ERR: O¢ODM Display diagnostic message. 

[26] >PROMPT Ask again. 


In SHARP APL, the system variable OTRAP may be assigned a delimited 
character vector (say, delimited by the 'V’ character) or a character 
matrix where each partition or row specifies the action to be taken 
for a given class of errors. For example, the expression 


OTRAP<’VO E >ERR' 


says to execute (E) the expression ~ERR if any (0) error occurs. 

The system variable OER (event report) contains a 3-row character 
matrix representation of the most recent exception. The first 5 
characters of the first row of DER contain the "event number" (e.g. 4 
for RANK ERROR). The default setting of OTRAP in a clear workspace 
is *'. 


The solution to the problem using SHARP APL: 


CO] --. ;OTRAP 
C1] DTRAP<’' Normal message and suspension if 
: error on lines 2 to 14. 
[15] OTRAP¢’'V O E ERR’ Branch to ERR if an error on 
[16] ’ Re’ , INPUT line 16. 


[17] OTRAP¢’’ 


[25] ERR: O0¢€5l0OER(OIO; }j Show error message but not event 
[26] O¢1 OJOER number. Show rest of diagnostic 
[27] PROMPT message. Ask again. 


In APL2, the dyadic system function OEA Cexecute alternate) takes an 
executable expression as its character vector right argument and 
executes it ala ®. If the expression can be executed without 
exception, the left argument of OEA is not considered. However, if 
the execution of the right argument of OEA generates an exception, 
the left argument is executed. The system variable DEM Cevent 
message) contains a 3-row character matrix representation of the 
diagnostic message of the most recent exception having occurred at 
the current level of the state indicator. 
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The solution to the problem using APL2: 


[16] '9ERR’ DEA ‘Re!’ , INPUT Branch to ERR if an error 
: during ¢. 

[25] ERR: O<OEM Display diagnostic message. 

[26] PROMPT Ask again. 


A different APL2 solution is possible by using the monadic system 
function DEC (execute controlled). OEC is like OEA in that its 
character vector right argument is an executable expression. 

However, the result of OEC is a 3 item nested array which contains 
information about the attempt to execute the right argument. The 
result can be evaluated to determine whether or not an error has 
occurred. fMEC distinguishes among the various types of executable 
expressions and so allows messages which are more specific than those 
possible from OFA. 


See an APL2 reference manual for more complete documentation on OEC. 
Here is the solution using DEC in APL2: 


[16] 2Z<¢DEC INPUT 
[17] 7ERR UNLESS Zf1l1jJ€1 2 Branch unless result or 
[18] R¢€3>Z assignment. 


[25] ERR:?CZ01)=0 3 4 5)/ERR1,ERR2,ERR3 ,ERR3 
[26] ERR1:0¢0EM 

C27] »PROMPT 

[28] ERR2:O0¢'’EXPRESSION HAS NO RESULT’ 

[29] »>PROMPT 

[30] ERR3:O0¢’ BRANCHING NOT ALLOWED’ 

C31] >PROMPT 
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PROBLEM: Suppose you have written a dyadic function STAT whose 
arguments are same-length numeric vectors and whose result 
is a numeric vector of various statistics. The function is 
self-contained, i.e. it requires no subfunctions or global 
variables and it assigns no global variables. Because of 
its syntax and self-containment, STAT behaves like a dyadic 
primitive function (Conce copied into the workspace) in all 
regards but one: error handling. When provided with faulty 
arguments, STAT suspends on one of its lines after 
displaying one of: RANK ERROR, LENGTH ERROR or DOMAIN 
ERROR. Rewrite STAT to signal these errors and others 
(say, WS FULL) to its calling environment rather than 
suspending. 
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TOPIC: 


EXCEPTION HANDLING 


Signalling the Error 


In APL*PLUS, the monadic system function OERROR takes a character 
vector error message argument and signals that message to the 
environment from which was called the function in which OERROR is 


executed. 


C1] 
C2] 


OOPS 


For example: 


V TEST N 
OERROR 'OOPS' 
V 

TEST 5 


TEST 5 
A 


Once the error occurs within STAT, the message may be found as the 


first line of the result of ODM. 


The first line is defined as all 


characters up to the first newline character (represented in APL*PLUS 


as OTCNL). 


The solution to the problem using APL*PLUS: 


[O] 
Ci] 


--- ;UELX 
OELX¢«’OERRORCA\ODM4OTCNL) /ODM’ 


In SHARP APL, the dyadic system function OSIGNAL takes an optional 
character vector error message left argument and a corresponding 
event number right argument and signals the message (and event number 
as the first 5 characters of DER) to the environment from which was 


called the function in which OSIGNAL is executed. 


C1] 
C2] 


OOPS 


The function OSIGNAL may also be used monadically. 
an integer event number in the range 1 to 999 (e.g. 


For example: 


V TEST N 

‘OOPS’ OSIGNAL 599 
V 

TEST 5 


TEST 5 
A 


Its argument is 
2 for the 


standard APL error message SYNTAX ERROR). 


The solution to the problem using SHARP APL: 


CO] 
C1] 


... sUTRAP 
OTRAP<'VO E OSIGNAL?5e0ER’ 


or, using DEC (environment condition): 


CO] 
C1] 


sa-0 FOEC 
DEC¢1 a Disallow suspension 
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In APL2, the monadic system function OES (event simulation) takes a 
character vector error message argument or a 2-element integer vector 
event type code (e.g. 1 3 for WS FULL) and signals the message to the 
environment from which was called the function in which OES is 
executed. For example: 


VY TEST N 
C1] OES 'OOPS'’ 
E24 V 

TEST 5 
OOPS 

TEST 5 


A 


The system variable DET Cevent type) contains the 2-element integer 
vector event type code of the most recent exception having occurred 
at the current level of the state indicator. 


The solution to the problem using APL2: 


C1] ‘HES DET’ OEA ‘line 1 of STAT’! 
C2] ‘OES OET’ OEA 'line 2 of STAT’ 


[3] 'OES DOET' OEA ‘line 3 of STAT’ 


s 
° 
s * 
° sd 


If self-containment were not an issue, the solution using APL2 would 
be to rename STAT to STAT1 and to write a new STAT function: 


V R¢A STAT B 
C1] ‘OES OFT’ UOEA 'R¢A STATI B’ 
V 
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PROBLEM: Given a 70-line function of critical code named PROCESS, 
incorporate exception handling such that an interrupt (the 
BREAK key) will cause the message PLEASE BE PATIENT to 
display and execution to resume. 


TOPIC: Detecting the Attention 


In APL*PLUS, the system variable OALX Cattention latent expression) 
may be assigned any executable character vector (as with OLX or 
OELX}. The character vector is executed only if an attention is 
Signalled (via the BREAK key). The default setting of OALX ina 
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clear workspace is 'DDM’. That is, the default behavior of the 
system is to display the diagnostic message generated by the 
attention and to suspend. 


The niladic system function OLC (line counter) returns an integer 
vector of the numbers of the suspended or pendent lines in the state 
indicator. The first number corresponds to the top (most recent) 
level in the state indicator and the last number corresponds to the 
bottom level. Since the branch primitive function () only considers 
the first element of its argument, the expression »®0LC will cause the 
flow of execution to proceed to the line on which the expression was 
executed. 


The solution to the problem using APL*PLUS (% is a statement 
separator): 


CO] see SAGX 
CiJ DALX¢'O¢’'’ PLEASE BE PATIENT’!’ O XOLC’' 


In SHARP APL, the event number 1000 represents any interrupt. The 
solution using SHARP APL: 


CO] ---;UTRAP 
C1] OUTRAP<€’V1000 E U«’’ PLEASE BE PATIENT’’ © 2%0LC’' 


In APL2, an attention is not considered an exception which can be 
handled. Therefore, this problem as stated cannot be solved using 
APL2. If the PLEASE BE PATIENT message is omitted from the problen, 
you may solve the problem in APL2 by "conditioning" the PROCESS 
function to not be interruptable: 


0 01 0 UFX OCR 'PROCESS' 
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PROBLEM: You wish to install a fully-tested production system such 
that if any unexpected (i.e. unhandled) error occurs, the 
message "AN UNEXPECTED ERROR HAS OCCURRED; CONTACT J. SMITH 
IMMEDIATELY" will display and the system will suspend on 
the exact line of the error. How will you modify the 
system to do so? 


TOPIC: Suspending the Function 


In APL*PLUS, the default behavior of the system when handling an 
exception is to suspend on the exception line. In fact, if DELX is 
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set to '’, that is all that will happen. Since this behavior would 
be confusing to the user (suddenly in immediate execution mode with 
no indication of an exception), the default setting of OELX is 
‘ODM’. With this setting, the diagnostic message is returned and 
displayed and then the function is suspended. The only way to not 
suspend the function is to branch Ce.g. OELX¢'*ENTER’) so that 
execution may resume or to signal an error (e.g. OELX¢'’OERROR 
''OOPS''') which may be handled by a more global OELX. 


The solution to the problem using APL*PLUS is to assign OELX globally: 
OELX¢'Oe’'AN UNEXPECTED...IMMEDIATELY''’ 


If an error occurs, OELX will be executed causing the message to 
display. Then execution will suspend. When Smith arrives, he will 
type ODM to display the diagnostic message and may resume execution 
by typing »0LC after correcting the error. 


If a function lacks exception handling but is a high security 
function, you may not want to allow the function to be left suspended 
after an exception. For example, a function which updates a payroll 
file may contain sensitive salary information in its local variables 
during execution. To prevent a function from suspending (i.e. to 
perform a >» when entering immediate execution mode) in APL*PLUS, 
localize the system variable OSA (stop action) and assign it the 
value 'EXIT’'. The default setting of OSA is '’. 


In SHARP APL, if an exception is not handled by the current setting 
of OTRAP, the diagnostic message will display and the function will 
suspend on the exception line. As with APL*PLUS, the only way to not 
suspend the function is to branch (e.g. OTRAP¢’VO E *ENTER’) so that 
execution may resume or to signal an error (e.g. OTRAP¢'’VO E '’OOPS'' 
OSIGNAL 599’) which may be handled by a more global OTRAP. 


The solution to the problem using SHARP APL is to assign OTRAP 
globally: 


OTRAP<'VO E O¢’'’AN UNEXPECTED...IMMEDIATELY'’ ’' 


If an error occurs, the message will display and execution will 
suspend. When Smith arrives, he will type DER to display the error 
report and may resume execution by typing »OLC after correcting the 
error. 


To prevent a high security function from suspending in SHARP APL, 
include the partition 'V2001 D EXIT’ in the current (local) 
definition of OTRAP. Event number 2001 represents the "immediate 
execution mode" event. D stands for "do". 


In APL2, if an exception occurs outside the execution of the right 
argument to OEA Cor OEC, a similar function), the diagnostic message 
will display and the function will suspend on the execution line. To 
keep the diagnostic message from displaying, each statement which may 
generate an error must be executed within the right argument of DEA, 
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or must be on a line of a function which is executed within the right 
argument of OEA, or must be on a line of a function called by a 
function called by QEA, and so on. 


Therefore, if COVERFN is the name of a function to which all of the 
other functions in the system are subfunctions, you may control the 
display of the diagnostic message by invoking the system with an 
expression such as: 


‘HANDLER’ OEA 'COVERFN’ 


Unfortunately, if an unhandled exception occurs anywhere within 
COVERFN, a suspension will not occur. Rather, the levels of the 
state indicator associated with COVERFN will be reset and HANDLER 
will be executed. Hence, we can choose either to detect the error 
but lose the suspension or to suspend by not detecting the error. 
For example, the expression 


'O<''AN UNEXPLAINED...IMMEDIATELY’ UEA 'COVERFN’ 


will cause the proper message to display. However, when Smith 
arrives, he will type DEM to display the event message and will find 
that it is empty and the state indicator is empty (unless an old 
suspension is lying around). 


To prevent a high security function from suspending in APL2, you may 
"condition" the function (say, PAYROLL) to not be suspendable: 


O10 0 Q@FX OCR ' PAYROLL’ 
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PROBLEM: Given a 20-line function MAINFN which calls a multitude of 
subfunctions, you would like to incorporate exception 
handling in MAINFN such that any error (even if ina 
subfunction with no exception handling) causes a branch to 
the line labeled ERRCODE in MAINFN and any attention causes 
a branch to the line labeled ATTNCODE. 


TOPIC: Controlling the State Indicator 


In APL*PLUS, there is no direct capability for solving this problem. 
Suppose MAINFN calls SETUP which calls READVARS which encounters an 
error on its 5th line. If there is no exception handling, the 
READVARS function will suspend and the state indicator will look 
something like: 
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ISI 
READVARS[5] * 
SETUPL19 ] 
MAINFN[ 4] 


A naive solution to the problem would be to include the following in 
MAINFEN: 


V MAINFN:....:0ALX;OELX 
C1] NALX<' »ATTNCODE' 
[2] DELX<'SERRCODE' 


[31] ERRCODE:etc. 
[41] ATTNCODE: etc. 


However, when the error in READVARS occurs, the value of OELX 
('»ERRCODE') will be executed, causing a branch to line 31 (the value 
of ERRCODE unless shadowed by a different ERRCODE local to SETUP or 
READVARS) of READVARS, not to line 31 of MAINFN. Somehow, we must 
get out of (i.e. »0) both READVARS and SETUP before branching to the 
line labeled ERRCODE. The only way to do this without landing in 
immediate execution mode (which would happen if you localized 
OSA¢'EXIT’ in both READVARS and SETUP) is to use OERROR. 


The approach we will take is this: set ODELX to check whether OELX 
has been localized at the current top level of the state indicator; 
if so (i.e. if in MAINFN), branch to ERRCODE; if not (i.e. if in 
SETUP or READVARS), use OERROR to reset the top level of the state 
indicator and to signal an error (say, the same error message) to the 
next level; since OERROR will trigger OELX, the process will be 
repeated until MAINFN is at the top of the state indicator. This 
process is called "propagating the error message". 


The monadic system function DIDLOC Cidentifier localization) returns 
a 1 row integer matrix of localization codes for the identifier whose 
name is provided as the character vector right argument. Each column 
of the result corresponds to one level of the state indicator (local 
to global) and the code “1 represents unlocalized. Therefore, if the 
result of 1lpNDIDLOC 'OELX’ is “1, then OELX is not localized at the 
top level of the state indicator. 


The initial solution to the problem using APL*PLUS: 


V MAINFN;... 50ELX 
C1] OELX¢’ eC 1=1e0NIDLOC’ 'OELX'')/' ’OERRORCA\ODMZ0OTCNL) / 


ODM’ 'O»ERRCODE’ 


This handles errors. What about attentions? We will use a similar 
approach: set OALX to check whether OALX has been localized at the 
current top level of the state indicator; if so (i.e. if in MAINFN), 
branch to ATTNCODE; if not (i.e. if in SETUP or READVARS), use OERROR 
to reset the top level of the state indicator and to signal an error 
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(say 'ATTN’) to the next level; OELX will be triggered at the next 
level and will behave as described above except it will branch to 
ATTNCODE rather than ERRCODE if the error message is ‘ATTN’. 


The final solution to the problem using APL*PLUS: 


V MAINFN;...;0ALX;0ELX 
C1] OALX¢'’eC~1=1eNIDLOC’ 'OALX'')/''OERROR'’!''ATTN!''''! 
OP ATTNCODE’ 
[2] DELX©«’ eC ~1=l1eNIDLOC’ 'OELX’’)/' 'OERRORCA\ODM#OTCNL) / 
ODM''O?C''ATTN' 'A.=4TODM)OERRCODE , ATTNCODE ' 


In SHARP APL, there is a direct capability for solving this problen. 

The action code C (cut) can be specified within the OTRAP definition 

to specify that the state indicator should be "cut back" (l.e. reset) 
to the level at which OTRAP is local. 


The solution to the problem using SHARP APL: 


V MAINFN;...;0TRAP 
Ci] OTRAP€'VO C ®ERRCODE V1000 C »ATTNCODE’ 


In APL2, the only way to solve the problem is to execute each of the 
20 lines of MAINFN as the right argument of OEA. If an exception 
occurs during the execution of the line, the state indicator is 
automatically "cut back" so that MAINFN is at the top level; then the 
left argument of OEA is executed. Since attentions cannot be 
detected as exceptions in APL2, no branch to ATTNCODE is possible. 


The solution to the problem using APL2: 


V MAINFN;...;ELX 
C1] ELX¢’>ERRCODE’ 
C2] ELX OEA 'line 1 of MAINFN’ 
[3] ELX OEA 'line 2 of MAINFN’ 
C4] ELX OEA ‘line 3 of MAINFN’ 


[26] ERRCODE:etc. 


The only problem with this solution is its brute force appearance. A 
typical APL2 solution to the problem involves restating the problen: 
rename MAINFN to be MAINFN1 and write a new MAINFN which will handle 
any exceptions occurring within MAINFN1. 


Here is the alternative APL2 solution: 


VY MAINFN 
C1] '>ERRCODE’ OEA ‘/MAINFN1’ 
[2] 20 


[3] ERRCODE:etc. 
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PROBLEMS: (Solutions on pages 479 to 482) 


1. The following function displays the records of a file. The user 
is asked for the number of the record at which displaying is to 
begin. Then the function reads and displays every record from 
the specified record to the end of the file. Incorporate 
exception handling so that the user may press the BREAK key to 
halt the display and to ask again for the number of the record at 
which displaying is to begin. Assume that the READ function uses 
exception handling such that an attention within READ causes READ 
to signal the error message 'ATTN’ to its calling environment. 


V SHOWFILE NAME;LIM;N 
C1] LIM¢TIEFILE NAME 
[2] ASK:O0¢’ BEGIN WITH WHICH RECORD?!’ 
3a N¢O 
[4] >(CO=N)/0 
[5] Loop:oe’’ 


C6] O<CéN),': ',READ N 

C7] Oe'!’ 

C8] > (CLIM=N¢N+1)/LOOP 
v 


2. Write a dyadic function NXPROMPTE which prompts for and returns a 
vector of numbers. Its right argument PROMPT is a character 
vector which is displayed to prompt the user for input. Input is 
accepted beyond the prompt on the same line. The left argument 
is the number of numbers required. If 0, the function will allow 
any number of numbers. If the word END is entered, the result is 
the scalar 1. Any primitive APL expression may be entered and 
will be executed. User defined functions or variables may not be 
included in the response. Note that this function is identical 
to the NPROMPTE function developed in the chapter, Writing 
User-Friendly Interactive Functions, except APL expressions are 
allowed. For example: 


Q¢0 NXPROMPT 'ENTER PROJECT NO.S: ' 
ENTER PROJECT NO.S: 10+18 

Q 
11 12 13 14 15 16 17 18 
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3. Write a dyadic function ERRATTN to initialize the exception 
handling facilities such that any subsequent error will be 
handled as directed by its arguments. If the left argument, ELX, 
is a character vector, it will be executed directly if any error 
should occur; if the left argument is a numeric line label, the 
state indicator will be reset (cut back) to the level at which 
ERRATTIN is called and a branch to the scalar will take place. 

The meaning of the right argument, ALX, is the same as that of 
the left argument but is for attention handling rather than error 
handling. If either argument is empty (e.g. ''), the response to 
the respective exception is to display the diagnostic message and 
suspend. For example: 


Ci] '' ERRATIN '!' 


[6] 'FIXFILE © 2%0LC’ ERRATIN L5 

[7] L4:APPEND DATA 

C8] >MORE 

[C9] L5:0¢€'FILE INCOMPLETE DUE TO INTERRUPT. ' 


On line 1, the error and attention handling is set to its default 
behavior (if an exception, display diagnostic message and 
suspend). On line 6, a character vector error handler and a 
numeric line label attention handler are provided. If an error 
occurs (say, within the APPEND function), the FIXFILE function is 
executed and execution is resumed. If an attention occurs, 
execution is resumed on the line labeled L5 (even if the error 
occurs within the APPEND function). 
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POSTSCRIPT 


The functions described and listed in this book are available on 
a floppy disk for the APL*PLUS PC system. To order one or more of 
these floppy disks, please send your check or money order to: 


ADVANCED APL FUNCTIONS 
ZARK INCORPORATED 

53 SHENIPSIT STREET 
VERNON, CT 06066 


Specify the number of disks desired and enclose $15 per disk. 
Postage and handling charges are included. Connecticut residents, 
please include sales tax. 


The following is a list of the workspaces and functions included on 
the disk. 


JWSID BOOLEAN 
pANDMAP PpANDRED PANDSCAN pEQMAP pEQSCAN pGEMAP pGTMAP 
pPLESCAN pLTSCAN PNEMAP PNESCAN pORMAP PORRED pORSCAN 
pPLUSRED 


JWSID CASHBAL 
CASH1 CASH2 CASH3 £CASH4 


JWSID CNFNS 
ASSIGN CNCAT CNEQ CNEST CNGRADEUP CNIDX 
CNIDXA CNIOTA CNLEN CNAM CNAV 


JWSID COMMENTS 
UNCOMMENT UNLAMP 


JWSID CRTIMING 
TIMER TIMEADEFINE 
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IWSID DATES 


FROMDAYS FROMDAYS360 FROMDAYSA FROMMDY 
FROMOTS FROMYD IPDATEMDY TODAYS 
TODAYSA TOMDY TOMDYA TOOTS 

JWSID ERROR 
ERRATTNP ERRATTNS 

IWSID FILEDOC 
FILEDOC 

JWSID FLF 
EMPLOYEES 

IWSID FNIDS 
IDENTIFY LOCALIZE OBFUSCATE RELABEL 
UNOBFUSCATE 

JWSID FNREP 
CRAVR VRACR 

JWSID FNSFILE 
DROPFN FNCREATE GETFN PUTFN 

JWSID FORMAT 
CENTER CJUST COLFMT COLUMNIZE DEB 
DTB HEADINGS LJUST RJUST ROWFMT 
TITLES 

IWSID INPUT 
CPROMPT CPROMPTE ESCAPE If LPROMPT 
MESSAGE NINPUT NINPUT2 NPROMPT NPROMPT2 
NPROMPTE2 NXPROMPTE PROPOSAL UNLESS 

JWSID INTEREST 
EFFECTIVE FCBOOK FCYIELD IROR NOMINAL 
VALUE 

JWSID LOOP 


LOOPI NEXTI 


ere UR 


POSTSCRIPT 


FROMMDYA 
TODAYS360 
TOYD 


UNDIAMOND 


DLB 
THORN 


LPROMPTE 
NPROMPTE 


SCHEDULE 


POSTSCRIPT 


JWSID MSF 
ADDEMP CATEMP CINPUT DELEMP EMPLOYEES IF 
LISTEMP MESSAGE NINPUT RCAT RESTART SELECT 
SQZEMP START UNLESS 


JWSID MULTI2 


ASSIGN CATREC CATRECWS COMPRESS DELREC 
EXECUTE FCREATE FERASE FOR FREAD 
FREPLACE FTIE FUNTIE INDEX INDEXA 
INDEXWS INDEXWSA INITFILE IOTA LOTARHO 
LAYERS NRECARECH SELECT SELECTWS SLASHTOTARHO 


JWSID MULTIFLO 


ASSIGN CATREC CATRECWS COMPRESS DELREC 
EMPLOYEES EXECUTE FOR INDEX INDEXA 
INDEXWS INDEXWSA INITFILE LOTA LOTARHO 
LAYERS SELECT SELECTWS SLASHIOTARHO 


JWSID MULTISA 


ASSIGN CATREC CATRECWS COMPRESS DELREC 
EXECUTE FOR INDEX INDEXA INDEXWS 
INDEXWSA INITFILE IOTA LOTARHO LAYERS 
SELECT SELECTWS SLASHIOTARHO 


JWSID NNFNS 
ASSIGN NNCAT NNCATSS NNCATSV NNCATVS NNCATVV NNEST 
NNIDX NNIDXA NNLEN NNSUMCOL 


JWSID PRTFILE 


PRINT 
JWSID QDOC 
QDoC 
JWSID REDUCE 
ANDRED MAXRED MINRED ORRED PLUSRED AAND AANDRED 
AANDWAY AMAX AMAXRED AMAXWAY AMIN AMINRED AMINWAY 
AOR AORRED AORWAY APLUS APLUSRED APLUSWAY 
JWSID SEARCH 
BY CMIOTA CMIOTA1lL CMIOTA2 DEB IOTA LIOTA 
LIOTA1 REPLACE UIOTA UIOTAI1 UNQCM UNQOCV UNQIO 
UNOI1 UNQNV ASS 
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JWSID SORT 
CGRADEUP CGRADEUP1 


JWSID TIMING 
COST TIMER 


A 


JWSID USEDBY 
USEDBY 


JWSID UTILITY 
MONIOTA REPL 


JWSID WP 
WRAP WRAPLP 

JWSID WSDOC 
WSDOC 


CGRADEUP2 UPPERCASE 


TIMEADEFINE 
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POSTSCRIPT 


TIMEADISPLAY TIMEARESET 


Chapter 1 Solutions 


LIMBERING UP 


J), AMOUNT[ AMOUNT16451¢«845 (if exactly one 
occurrence of 645) 
or 
AMOUNT CAMOUNT=645) /tLeAMOUNTI€845 Cif O or more 
occurrences) 
or 
C CAMOUNT=645) /AMOUNT)€845 CAPL2) 
25 A/ (CPREMS>100)APREMS<500 
a +/24=L0.5+WEIGHT 
4. x/ oMAT or eo, MAT 
5. ‘ANSWER IS ’',(CéANS),’' YEARS’ 
6. NAMES¢NAMES ,£1101!loNAMES ) TNAME Corigin 1) 


7. A. Since a scalar has no shape, its shape is an empty vector. 
Therefore, the result of 912 is an empty vector. When the right 
argument of branch (7) is an empty vector, no branch takes 
place. Control proceeds to the next statement. 
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8. R¢CCCoVIe1l O0)/V)+4+256xCCeVIEN 1)/V 
or 
R€2561L90(N, 2) 0V 
or 
REC (N,2)eV)+.x1 256 
or 
R¢+/(N, 2) eVXCoV)Jel1l 256 


9. The reduction of an empty vector returns the identity element for 
the dyadic function involved in the reduction. The identity 
element is that value which when supplied as one of the arguments 
of the dyadic function will return the other argument. For 
example, since O+5 is 5 and 1x5 is 5, the identity element for 
plus (+) is 0 and for times (x) is 1. The only argument to 
minimum (lL) which will always return the other argument is 
positive infinity. Since positive infinity cannot be represented 
as a number, APL returns the next best thing, the largest 
possible number which can be represented on the computer. This 
value varies from implementation to implementation but is 
generally some value like 1E77 or 2E300. For non-commutative 
functions (like +), the identity element is that value which when 
supplied as the right argument of the dyadic function will return 
the left argument. For example, since 5+1 is 5, the identity 
element for divide is 1. 


10. The running alternating sum. 


-\ 3 8 6 5 
3 "5 1 “4 (3),03+¢€-8)),03+(0-8)+6),03+0-8)+6+(-5)) 


11. Display the state indicator (via ")SI") to see where the 
suspension occurred and branch to the line number shown on the 
top of the state indicator. Alternately: 


>0LC 


12. Most implementations of APL insist that the header of a function 
not be changed once the function is called. It is too tricky to 
handle the problems which arise when you make a global variable 
local, or vice versa, while the function is suspended. Likewise, 
labels may not be added or deleted (or possibly moved to 
different lines) in a suspended function, since the values of 
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labels are assigned at the moment the function is called. Most 
implementations of APL will allow you to make such editing 
changes to a suspended function but will display a message like 
SI DAMAGED or SI ERROR or SI WARNING and will not allow you to 
resume execution at the point of the error. You will need to 
reset the state indicator and rerun the function from the 
beginning. 


13. The "rank" or "ranking" of VECTOR. For example, if 6 students 
have these test scores, 


SCORES¢87 99 83 85 65 100 
the ranks of the respective students are: 


445SCORES 
4523 1 6 


where the 1 corresponds to the student with the lowest score and 
the 6 corresponds to the student with the highest score. 


14.a. C~1leA),1lo0B 
b. ditto 

153s CeA),eB 
b. ditto 


16. OAI Caccounting information), a niladic system function. Usually 
the second element of the result of OAI is a measure of CPU time 
consumed since signon to APL. 


17. «PROMPT O<“PROMPT 
R¢ (Ce PROMPT) JO Cor: OARBOUT 10 in APL*PLUS) 
RO 
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18.a. (N,1)e’ ' 


b. (N-1) pUTCNL 
(N-1)e0AV[156+010] 
(N-1)o0TCL£1+0101 


LIMBERING UP 


Cor: (N,0)o'’ =in APL2) 


CAPL* PLUS ) 
(SHARP APL) 
CAPL2 ) 


Note: None of these three expressions works correctly if N=O. 
They generate a DOMAIN ERROR. 


Lo. OEX ONL 2 
OERASE OIDLIST 2 
6 OFD 1 UWS 2 


20.a. SAMODEL¢12 14 
12 14 OUSTOP ‘MODEL’ 


b. Localize T in INTERPOLATE. 
line 11 and before line 13. 


take place on line 12, 


CAPL*PLUS, SHARP APL, APL2) 
CAPL*PLUS ) 
(SHARP APL) 


(APL*PLUS, SHARP APL, APL2) 


CAPL*PLUS PC, APL*PLUS UNX) 


T was somehow reassigned after 
Since the reassignment does not 


it must take place within INTERPOLATE 


Cor a subfunction of INTERPOLATE). 


21. Including the header in the count of lines, 


1teODCR 'CALC’ 
1te2 OFD 'CALC’ 


~1++/O0TCNL=OVR ‘CALC’ 
~1++/0AVE156+0T0)]=1 OFD 


CAPL*PLUS, APL2) 
CSHARP APL) 
CAPL*PLUS ) 


’CALC' (SHARP APL) 


22. Because DIO=0 and some elements of V2 are not found in V1. 
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BRANCHING AND LOOPING 


1. The expression does not work correctly in index origin 0; it 
modifies the random link (ORL); and it labels the programmer as 
having brain damage. 


2s »>(xXN)O®ZERO, POSITIVE,NEGATIVE 
or 
+> (NEGATIVE, ZERO, POSITIVE) ([2+xN] 
or 
>9(°-1 0 1=xN)/NEGATIVE, ZERO, POSITIVE 
3. a. SUM¢€0 lor SUM€0O 
T€11 CMPS€¢8+3xt100 
LOOP: 9ENDLOOP IF I>308 Tel 
SUM¢SUM+READ I LAB¢(C100e LOOP) , ENDLOOP 
T¢1I+3 LOOP: SUM¢SUM+READ CMPS[T] 
»LOOP leI+1 
ENDLOOP: »*LAB[I] 
ENDLOOP: 
Cs SUM¢0 d. SUM¢€O 
I@ENDLOOP,100 11 3 »*LOOPI ENDLOOP,100 11 3 
SUM¢SUM+READ I SUM¢SUM+READ I 
oI 2NEXTI 
ENDLOOP: ENDLOOP: 
e. SUM<0 where: VY SUMUP CMP 
SUMUP”” 8+3x1100 C1] SUM¢SUM+READ CMP 
Vv 
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4. or0¢1 
RSCAN¢€(1+RATE) *LTERM 
OPRIN¢€RSCANXLOAN-+\ PMT+RSCAN 


5. After performing the transformations, the result is DEPOSIT. 
Undoing the transformations yields the following function: 


CWSID: CASHBAL] 

V BALANCE¢RATE CASH4 DEPOSIT;ACCUM 
[1] aA Returns stream of cash balances for deposits 
[2] aA DEPOSIT and corresponding rates RATE. 
[3] aA Performs: 
C4] A BALANCE[CITJ¢DEPOSIT(CIJ+BALANCE[I-11]XRATE[I-11]+1 
[5] ACCUM€Ox\1,01+RATE 
[6] BALANCE¢(+\DEPOSTTXxXACCUM)+ACCUM 

V 


Note to actuaries: The approach taken here is to compute the 
future value of each deposit as of the last period, subtotal the 
future values, and then discount each subtotal back to the 
deposit date. Alternately, the approach in CASH3 is to compute 
the present value of each deposit as of the start of the first 
period, subtotal the present values, and then accumulate each 
subtotal back to the deposit date. 


6. Notice that the WRAP function below will iterate only as many 
times as the number of lines generated by the longest sentence. 


[WSID: WP] 
V REWID WRAP CVEC;01I0;BL; BREAK; LAST; LEN ;MORE;NL;START; 
TCNL 
[1] aA Wraps text CVEC into lines of length WID 
[2] aA or less by inserting newline characters. 
[3] aA Origin 1: 
[4] O10¢1 
[5] Am Newline character: 
C6] TCNL¢UTCNL A APL*PLUS 
C7] aA TCNL€EOTCEL2] A APL2 
C8] aA TCNL¢OAVL157] A SHARP APL 
[9] aA Flag newline characters: 
C10] NL€CVEC=TCNL 
[11] A Index before start of each sentence: 
[12] START¢O,NL/teNL 
C13] aA Lengths of sentences (between newlines): 
[14] LEN¢€ §1+C1/START,1+o0CVEC)-START 


=—320> 


Chapter 2 Solutions BRANCHING AND LOOPING 


V WRAP (continued) 
C15] a Flag valid break points (blank followed by 
C16] A nonblank): 
C17] BL€eCVEC=!' ' 
[18] BREAK¢BL>10BL 
[19] aA Initialize result from argument: 
[20] R€CVEC 
[21] a Flag sentences still to be broken: 
[22] LOOP: MORE¢LEN>WID 
[23] A Select just those remaining: 
[24] LEN€¢MORE/LEN 
[25] a Exit if none left: 
[26] »CO=pLEN)/0 
C27] START¢MORE/START 
[28] A Find last break point within WID chars of line: 
[29] LAST¢+/Vv\BREAK(START°.+®iLWID] 
[30] A Advance start to new break point: 
C31] START¢START+LAST 
[32] a Insert newlines: 
[33] RC(STARTI]¢TCNL 
C34] A Decrement remaining lengths: 
C35] LEN¢LEN-LAST 
[36] A Repeat: 
[37] LOOP 

V 
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C1] 
[2] 
[3] 
[4] 
[5] 
[6] 
C7] 
C8] 
[9] 


Vv 
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CWSID: TIMING] 
COST ;T 


A Displays dollars consumed since COST was last 
A run (Cas recorded in global AAI) and since 
A signon, assuming 75 cents per unit of DAI(2]. 
A Record time consumed so far: 


T¢+OAIC1+0OI0] 


A Check for global AAI and branch unless found: 


>9C(xXONC ‘'AAT')IL1 


A Display consumption since last use of COST: 


C10 2 60.75xT-AAI),’ DOLLARS CONSUMED’ 


C10] aA Display consumption since signon: 
C11) L1:010 2 60.75xT),' DOLLARS SINCE SIGNON’ 
[12] A Reassign global AAT: 


C13] 


C1] 
C2] 
[3] 
[4] 
C5] 
C6] 
C7] 
[8] 
[9] 
[10] 


V 


D>DDDODDOD 


AAT¢T 


CWSID: CRTIMING] 


Times the execution of the character vector ACA 


by running it ANA times. Returns a numeric 
of the average CPU time consumed per run. 


Prepare to build local functions... 

No. of columns in canonical representation: 
ABAC2414+0,ACA 
AAAC(7,ABAJe’ ! 
AAALOIO+1; J€ABAT’ AEACOAIL1+0101' 

AAALOIO+23; 1¢ABAT’! ATA€O' 
AAALOIO+4;1¢ABAT'ADA:',ACA ——™ 
AAATOIO+5; 1<¢ABAt’>AaLa' 
AAACOUIO+6; JEABAT’ AZA?: AEACUAIT1+01O]-AEA’ 


seat A 


scalar 
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V TIMER (continued) 
[17] A Define local fn AFA to run ACA: 
£18] ARACUFX AAA 
[19] A 
[20] aA Define local fn AGA to run nothing: 
[21] AAACOIO;5+0I0OJ€'G' 
[22] AAACOIO+43;1¢€ABAT’' ADA: ' 
[23] ARACOFX AAA 
[24] A 
[25] A Run the functions (disallow negative result): 
[26] ARACOl (AFA ANA)-AGA ANA 
[27] A Return the average: 
[28] ARACARA+ANA 


V T¢TRY SIZES;SL3R;3I 
[1] a Use as: TRY 50 100 for 50 row left arg, 100 right 
C2] L¢SIZES(1] 
C3] R¢SIZES[2] 
[4] Le(L,12)eDAVL?(Lx12)0256] 
C5] R¢L{ ?ReleeLI 
[6] aA Run it 5 times: 
C7] T¢5 TIMER 'I¢L CMIOTA R' 
V 


Define the numbers of rows for the left arguments: 
L€¢5/10 50 100 500 1000 
and for the right arguments: 
R¢25e10 50 100 500 1000 
Then time them all: 
V T¢L DOIT R 
Cij T¢CoL)e0 
C2] I¢0 
C3] LOOP: >(CeL)<I¢I+1)0e0 
[4] TCIJ¢TRY LII],REII 
[5] +LOOP 
V 


Ti¢L DOIT R 
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Change CMIOTA as instructed (to activate the looping algorithm) 
and run DOIT again: 

V CMIOTA 
Cedit it) 

V 

T2¢+L DOIT R 


The variables L, R, T1 and T2 will be needed in a problem at the 
end of the Curve Fitting chapter. Record and save them: 


JSAVE CMTIMES 
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POSITIONING CHARACTER DATA 


R¢CA\TEXTANL) / TEXT 
or 
R¢( CTEXTUNL)-OIO) o TEXT 
or 
R¢C+/A\TEXT#NL) op TEXT 
R¢CCODE='/'JA'L'=1LILCODE,’*' 
(What is wrong with: R¢ (CODE='/')A'UL' =1OCODE ?) 
(Try it on: CODE¢’LI€¢B/teB A USES /’ ) 
R€CODE OSS '/vu' CAPL* PLUS ) 
Re’ /iL' €CODE CAPL2 ) 


CWSID: FORMAT] 

V R¢W CENTER C 
C1] a Pads character vector C to width W, centering 
[2] aA it within that width. 
C3] ReWTC(CLCOMW-e,C)+2)0’ '),Cc 
C4] aA Alternative: 
[5] A REWt(-L(Wte,C)+2)TC 

Vv 
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C1] 
C2] 
C3] 
[4] 
C5] 
C6] 
[7] 
C8] 
C9] 
[10] 


C1] 
C2] 
C3] 
C4] 
[5] 
C6] 
C7] 
C8] 
[9] 
[10] 
C11] 
[12] 
C13] 
C14] 
[15] 
C16] 
C17] 
C18] 
[19] 
[20] 


C21] 
[22] 
[23] 
[24] 
[25] 


POSITIONING CHARACTER DATA 


[WSID: FORMAT] 


V R€D ROWFMT N 


V 


Formats a numeric matrix N into a character matrix. 
Dis an integer vector with one element per row 
of N. The integers indicate the number of decimal 
places, for each numeric row, to be displayed in 
the character matrix result. Each number is 
formatted in a width of <width> characters (e.g. 10), 
where <width> is an integer scalar global variable. 
Requires subfunction: COLFMT 
R¢D COLFMT&N 
R¢(C1l,width)xpNJep 2 1 3 &C(%eN) ,width)oR 
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CWSID: FORMAT] 


V R€RC COLUMNIZE CMAT;C;COLS ;PGS;ROWS ;0I0 


V 


Restructures skinny character matrix CMAT into 
a fat one. RC is scalar or 1 or 2 element 
vector. Last element is no. of "columns" of 
CMAT running down each page of the result. 

If RC has 1 element, result is a short 

(f C1TeCMAT)+RC rows), fat CRCX11oCMAT columns) 
matrix. If RC has 2 elements, first element is 
number of rows per page. Result isa 3 
dimensional character array with 

CF CLT PpCMATI=+x/RC) planes, RCL[1] rows and 
CRCX1LoeCMAT) columns. 
OLO¢+1 
RC¢,RC 
COLS€ 1TRC 
Ce1loCMAT 
a Branch if 3 dimensional result: 

>(2=oRC)oeLI1 
A 2 dimensional result: 

ROWS¢l (1toCMAT)+COLS 

R€CROWS ,COLSxC)o 2 1 3 &CCOLS, ROWS ,C)eCCCOLSxROWS ) ,C)T 
CMAT 

30 
A 3 dimensional result: 
L1: ROWS¢€1TRC 
PGS¢f C1TeCMAT)+ROWSXCOLS 
R¢(CPGS ,ROWS,COLSxC)o 13 2 4 &C(PGS,COLS,ROWS,C)oe(CPGS*x 
COLS XROWS ) ,C) TCMAT 


>DDDDIDIDDIDDIOO D 
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[WSID: FORMAT] 

V REWID HEADINGS CS;0I0;A;ARGSTART;B; BHDG; BSEG;HDGLEAD; 
LEN ;NCOLS ;NHDG;NROWS sNSEG; RESSTART 3S ;SEGLEAD; SEGROWS ; T 
Creates column headings from text CS within 

field widths WID. CS is a character vector with 
text for successive headings separated by 'n’. 

The format of WID is: (widths of headings, not 
including spacing),(spacing between columns). 

If WID has fewer elements than CS has segments, 

its values are repeated; if it does not include 
spacing specification, 2 is used. Empty headings 
are not underlined. Separate lines of multi-line 
heading by '¢’. 


fi] 
C2] 
C3] 
[4] 
[5] 
f6] 
C7] 
C8] 
[9] 
[10] 
C11] 
[12] OToO¢o 

C13] A 1s for hdg starts: 

C14] BHDG¢ecs='n' 

[15] aA 1s for segment starts: 

[16] BSEG¢CSe'’ne’ 

C17] A 

[18] A Flag end of hdgs (1 elt per segment): 

[19] BHDG¢10BSEG/ BHDG 

[20] A No. segments per hdg: 

[21] T¢BHDG/teBHDG 

[22] NHDG¢eNSEG¢T-"1l°1,T 

[23] A Segment lengths: 

[24] T¢BSEG/lipBSEG 

[25] LEN€(11T,o0BSEG)-T+1 

[26] A Spacing between hdgs (2s if omitted): 

C27] S¢NHDGLWID 

[28] S€NHDGeS,(0=eS)e2 

[29] A Reshape widths to conform with headings: 

[30] WID«¢NHDGoWID 

[31] A Truncated segment lengths: 

[32] LEN¢LENLNSEG/WID 

[33] A Leading blanks per segment, to center: 

[34] SEGLEAD¢f ( (NSEG/WID) -LEN)+2 

C35] A Col in which each hdg begins: 

[36] HDGLEAD¢€7110,+\WID+S 

[37] A Index of char. following delim. of each segment: 
[38] ARGSTART¢1+BSEG/ Lp BSEG 

[39] A No. of rows and columns in result (A is no. rows 
[40] A without underlines): 

C41] NROWS¢1+A¢€l /NSEG 

[42] NCOLS¢(C+/ CNHDG+~ 1)0S)++/WID 

[43] A No. whole rows before each segment: 

[44] Te¢NSEG/A-+\NSEG 

[45] SEGROWS¢T+lLoeT 

[46] A Index in raveled result where each segment starts: 
[47] RESSTART¢SEGLEAD+ (NSEG/HDGLEAD) +SEGROWSXNCOLS 

C48] A Create blank, raveled result: 

C49} R¢€CBENROWSXNCOLS) 0!’ ' 


D>DDDOIDIIVIOODO 
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V HEADINGS (continued) 
[50] a Flag nonempty headings: 
[51] T¢NHDGoed 
C52] TLCxXLEN) /NSEG/UNHDG]¢€1 
[53] A Indices of underlines in raveled result: 
[54] A€¢T/WID 
[55] A€A/(T/+\"1/C0B-NCOLS) ,WID+S)-"“11l0,+\A 
[56] A+A+loA 
[57] A Insert underlines: 
[58] RCA]¢'’-' 
[59] A TeMONIOTA LEN: 
[60] T¢T+LoT¢LEN/-~110,+\LEN 
[61] RLCT+LEN/RESSTARTI¢CS(CT+LEN/ARGSTART] 
[62] A Reshape result to matrix: 
C63] R¢C(NROWS ,NCOLS) oR 
V 
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SORTING AND SEARCHING 


1. If your APL implementation supports matrix right arguments to 4A: 
SPNUM¢PNUM[ A4APNUM; ] 
Otherwise (major-to-minor sorting done in minor-to-major order): 


G¢4PNUML 53] 

G¢GLAPNUM[G;2]] 
G¢GCL4APNUM[G;11]] 
SPNUM¢PNUM[G; ] 


Or pack and sort: 
SPNUM¢PNUM([41E3 1E7 1E41&PNOM; ] 


Note: Numbers are packed to 14 digit floating point numbers 
(the 1E4 assumes 4 digit extensions). Since the 14 digits 
do not exceed the 16 or 17 digits of available precision 
and since 4 works with the full precision (i.e. does not 
refer to OCT), the result will be accurate. 


2. Since dyadic . is dependent upon OCT (for floating point 
arguments), it will "find" matches which are very close but not 
exact. The IOTA function will overlook such close values and 
will "find" only those values which match exactly (to 16 or 17 
digits of precision). This will usually not be a problem since 
IOTA will most often be used on integer arguments or on floating 
point arguments whose values have not been computed. Only 
through such computations will "equal" values become slightly 
different. 


-334- 


C1] 

[2] 

[3] 

[4] 

[5:1 

[6] 

C7] 

[8] 

[9] 

[10] 
C11] 
C12] 
[13] 
[14] 
C15] 
C16] 
i ag 
[18] 
C19] 
[20] 
[24] 
[22] 
C23] 
C24] 
[25] 
[26] 
[27] 
[28] 
[29] 
C30] 
[31] 
[32] 
[33] 
C34] 
[35] 
C36] 
C37] 
[38] 
C394 
C40] 
C41] 
[42] 
[43] 
[44] 
[45] 
C46] 
[47] 
C48] 
[49] 
[50] 
[51] 


Chapter 5 Solutions 


V 


V 


[CWSID: SEARCH] 

INDS<BASE IOTA VALS;SA;F;G3;I;L 

RA Returns the indices of BASE at which the 

A elements of VALS first match, l.e. 

A INDS¢BASELVALS (but maybe faster) 

A Branch if right arg a vector: 

>C1l=eeVALS )eL1 

A Handle scalar right arg: 

INDS¢BASEtUVALS 

70 
L1:L¢(pBASE)(OIO} 

A¢( pVALS)([OIO] 

Aa Branch unless no elts in either arg: 
>(xF¢ALL)oL2 

A Handle empty arg: 

INDS<ApDIO 

>0 

Aa Branch if both args have more than 1 elt: 
L2:>CF#1L)eL4 

A Branch unless left arg has 1 elt: 

9(L41)eL3 

A Handle 1 elt left arg: 

INDS<€UI0+VALS# BASE 

>0 

Aa Handle 1 elt right arg: 
L3: INDS<BASELUVALS 

>0 

a Branch if sort alg. 
A (remove A after replacing C1,C2,C3,C4 by 
A computed constants): 
L4: Aa>0C0C44+C5XL+A)>C1+AXC2+C3XL) op L5 

A Combine args. 
G¢AACBASE, VALS 
AcA[G] 


Aa Flag lst of distinct elts by shifting and comparing: 


F¢A# 10A 

A Insure 1st elt is 1 Cin case all rows the 
FLOTOJ€1 

Q Indices of ist distinct elts: 

I¢F/G 

A Replicate for each like elt: 
FLEOIOJ<0I0 

T¢€1l+\F] 

A Unsort indices (to catenated order): 
INDS<¢I 

INDS(CGIJ€I 

A Keep those corresponding to right arg: 
INDS¢€LLINDS 


a Set 'not found’ inds to ‘one greater’: 
INDS¢€INDSLL+OI0 
20 


a Use looping algorithm if more efficient: 
L5: INDS©€BASEtUVALS 


5597 


SORTING AND SEARCHING 


costs more than looping alg.: 


and sort (like values together): 


same): 
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Ba I¢CPNUMA.=P)11 
or 
T¢C1E3 1E7 1E418PNUM)L1E3 1E7 1E4iP 


4. LOWER€1000 10000 20000 50000 70000 100000 
R€¢Ci 25 3 5 4 5)JCLOWER LIOTA SALARY] 


5% 
CWSID: SEARCH] 
V R¢DEB CVEC 
[1] aA Deletes extraneous (leading, trailing or 
C2] aA redundant) blanks from its argument and 
[3] aA returns the compressed result. 
[4] a Put extra blank on beginning and end: 
C5] CVECe” -" ;CVEC,”" 7% 
[6] a Search for 2 contiguous blanks: 
C7] Re 1l1lC~CVEC ASS ! ')/CVEC 
V 
6. O+"1lCCéNVEC),’ '3 REPLACE '"1 ' BY 'N/A ' 
fe O+€v/0 “24 CeENAMES )o0(C,UPPERCASE ENAMES)ASS 'SON’' )#ENAMES 


(Note: the last 2 Boolean columns are dropped to avoid 
coincidental wrap-around matches, e.g. if row 5 ends with 
‘SO’ and row 6 starts with 'N’') 

In APL2: 


OH«+Cv/'SON'’ €UPPERCASE ENAMES) #ENAMES 
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SELECTING 


1. (CoeVJel 0)/V 
Or 
VLE (-0I10)+2xiCeV)+2] 
or 
(CC CeV)+2),2)eV)IL30I01 
a (2e0I0) eM 
or 
(,CoM)o(C1+1lpepM)T1)/,M 
or 


(( (-010)+t1leeM)OM)C;O0I0] 


3. The expressions (°-1TV) or V[eV] return one element vectors, not 
scalars. Therefore, the result of M[;"1TV] is a one column 
matrix, not a vector. The following produce correct results: 


wML371TV] 
or 

,ML: VE CeV)-~0I01] 
or 

ML:(10)e71TVI 
or 


ML ;(10)poV] 


4. Approach 1: 


SHAPE¢pNAMES 

NAMES¢ , NAMES 

NAMES[ (NAMES='/')/toNAMES]<’, ' 
NAMES<SHAPEpNAMES 


IO 7 = 


Chapter 6 Solutions SELECTING 


Approach 2: 


A1¢A2¢(OAVENAMES ) /DAV 
A1[(Al1='/')/UpAlI€',? 
NAMES¢A1(A2UNAMES] 


Approach 3 (APL2): 


(C’/'=,NAMES)/,NAMES)¢€’ ,’ 


>. A. OLO¢O0 
MRATES¢(C , RATES ) (DUR+16XIAGE+100XSEX] 


Bs OHTO<0 
ADUR€15LDUR 
ATAGE¢IAGE+DUR-ADUR 
MRATES¢€( , RATES ) [ADUR+16XAIAGE+100xXSEX] 


Cy OIO<0 
VRATES¢, RATES 
VRATES [NEWDUR+16xNEWIAGE+100XNEWSEX ]©«NEWRATES 
RATES€( opRATES ) pVRATES 


[WSID: SEARCH] 
V R¢EUNQNV NV; FIRST:G;:SORTED 
[1] aA Returns the distinct elements of the 
[2] a numeric vector NV. 
[3] SORTED¢NVIGEANV] 
[4] FIRST¢+SORTED# 1%0SORTED 
[5] a Set 1st elt to 1 in case FIRST all Os: 
C6] FIRST(EUXoFIRSTI€1 
C7] R¢FIRST/SORTED 
[8] aA FIRST(LLUXpFIRSTI<OIO 
C9] aA ind¢G 
C10] aA ind(GJ]¢+\FIRST 
V 


CWSID: SEARCH] 
V R¢eUNOCV CV 
[11 aA Returns the distinct elements of the 
[2] a character vector CV. 
[3] Re (COAVECV) /OAV 
[4] aA ind€Rticv 
V 
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[WSID: SEARCH] 
V R¢EUNOCM CM;FIRST;G;SORTED 
[1] a Returns the distinct rows of the character 
[2] aA matrix CM. 
C3] G¢0AVACM 
[4] a If dyadic 4 unavailable: 
C5] A G¢OAV CGRADEUP CM 
C6] SORTED¢CMI[G; ] 
C7] FIRST€¢V/SORTED# LESORTED 
[8] a Set 1st elt to 1 in case FIRST all Os: 
C9] FIRSTCXtLeFIRSTI¢1 
C10] R¢FIRSTZSORTED 
C11] A FIRST(UXeFIRST]¢OI0 
[12] A ind¢G 
[13] A ind[G]¢+\FIRST 
V 


[WSID: SEARCH] 
V REN UNOII1 IV;3;BIT;ULO 
[1] aA Returns the distinct elements of the 
[2] aA origin 1 index vector IV. All elements 
[3] A of IV must be elements of UN. 
{4] ODI0¢1 
C5] BIT+Noe0 
[6] BITCIVI¢1 
[C7] R¢BIT/iN 
[8] aA ind¢+C(BIT\lLeR)ICIVI 
[9] aA Alternative: 
[101] A ind¢(+\BIT) CIV] 
V 


CWSID: SEARCH] 
V REN UNOIO IV3;BIT;ULO 
[1] aA Returns the distinct elements of the 
C2] a origin O index vector IV. All elements 
[3] aA of IV must be elements of UN. 
[4] OITO<0 
[5] BIT€No0 
C6] BITCIVIJ€1 
[7] R¢BIT/UN 
[8] a ind¢(BIT\lLeR)CIVI 
[9] a Alternative: 
[10] A ind¢(+\BIT)CIVI-1 
V 
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or 


or 


or 


or 


or 


or 


R¢+/’' ECMPH’o.=TZONE 


I11<’ECMPH’ tTZONE Cutility function) 
R¢5 11 PLUSRED 1 


R¢(53;'ECMPH' uTZONE)+/1 Chypothetical) 


R«('’ ECMPH' oc. =TZONE)+.xXSALES 


I11¢'ECMPH’' tTZONE Cutility function) 
R¢5 11 PLUSRED SALES 


R¢(5;'ECMPH' UTZONE) +/SALES (hypothetical) 


A¢'ECMPH’ co. =TZONE 
B¢eTYPE°.='’BCPS’ 
FRQ¢A+.AB 

AMT¢ (Ax (eA) eSALES)+. xB 
MAX¢(Ax(C eA) eSALES)I.xB 


I11¢’ ECMPH’ uTZONE Cutility functions) 
IT1i2¢’BCPS'.TYPE 

FRQ¢5 4 11 12 PLUSRED 1 

AMT¢5 4 11 12 PLUSRED SALES 

MAX¢5 4 11 12 MAXRED SALES 


IL11¢'’ ECMPH’ tTZONE (hypothetical) 
T12¢'’BCPS’tiTYPE 

FRQCC5 43;1113;112)+/1 

AMT¢(5 4;111;112)+/SALES 

MAX¢(5 4;2113;112)//SALES 
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[2] 

[3] 
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FREQUENCY COUNTS, ACCUMULATIONS AND CROSS-TABS 


[WSID: REDUCE] 


VY R¢L PLUSRED ARRAY ;CIND; CUM; DIM; DSHAPE;GRADE;1I;LAST;M;N 


;RANK;RRI5;SORTED;URRI 

A No. of ways for N-way reduction: 
NeL(o,L)+2 

Aa Which dimension to be "reduced"? 
DIM¢1T( (N+N)1L), ~LTLRANK¢p pARRAY 

a Branch unless O-way reduction: 
9(xNJoeLl 
R¢+/(CDIMJARRAY 
>0 

A Separate left arg into its pieces: 

L1: DSHAPECNoL 
CIND¢NoNIL 

Aa Begin to compute "raveled result indices": 

I<ULO 

A Index from tDSHAPEL[I] to cause index error if 

A invalid indices: 

RRI€CtLDSHAPE(CITIJ){¢'I'’ ,sCIND(1I}] 

A Branch if origin is 1: 
>0IOeLOOP1 

A Continue computing RRI (N iterations for N-way 

A reduction): 

LOOPO: 3(NsI¢I+1) 0ENDLP 
RRI€CUDSHAPEC TI) CC ¢'I’,és¢CIND[I])]1+DSHAPE[LI]xRRI 
>LOOPO 

LOOP1:39(N<I¢1I+1) oENDLP 
RRI€CUDSHAPECII)C Ce'I’ ,sCIND[LI1)1+DSHAPE[I]xRRI+71 
>LOOP1 

A Determine unique elements of RRI: 

ENDLP: GRADE¢4RRI 
SORTED¢RRI TC GRADE] 

LAST¢SORTED# 19SORTED 
LASTE (xe LAST) e( oe LAST) -~OI0]¢1 
URRI¢LAST/SORTED 

A Branch unless ARRAY a scalar (i.e. 
+(xRANK) 0 L3 

A Perform partitioned frequency count: 
CUM¢LAST/ tLe LAST 
CUM¢CUM- (oe CUM) 9 ( ~1+0I0) ,cUM 

Aa Branch if ARRAY is 1 Cusually): 
+(1=ARRAY)pL2 

A Multiply freq. 
CUM<ARRAY xCUM 

A Initialize result with 0's: 

L2:R¢(xX/DSHAPE) 00 

A Insert result of partitioned freg. count: 
RCURRI J¢CUM 

A Reshape to desired shape: 

R¢DSHAPEpR 
>0 


freq. count): 


counts by scalar: 


=J4aL= 
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V PLUSRED (continued) 
[49] A Reorder ARRAY to conform with SORTED: 
[50] L3:M¢DIM-OIO 
[51] ARRAY¢¢?’'ARRAYE’,(Mo’3;'),'GRADE’ ,(CRANK-Mti)0e’3'),'1' 
[52] aA Perform partitioned reduction: 
[53] CUM¢LAST/{CDIM]+\CDIMJARRAY 
[54] CUM¢€CUM-CeCUM)TO, [DIM1ICUM 
[55] a Initialize result. Fill with identity elt. 
[56] A Ravel the DSHAPE dim.s: 
[571 R¢C€C-M)%(x/DSHAPE) ,1/M®oeARRAY ) 00 
[58] a Insert result of partitioned reduction: 
[59] ¢’RE',(Mo’3;’),'URRI’,(CRANK-Mt+1)0'3'),'1€CUM' 
[60] A Reshape to desired shape (unravel DSHAPE dim.s): 
[61] R¢€C-M)ODSHAPE,1J/M®oARRAY) oR 
V 


CWSID: REDUCE] 
V R¢eL MAXRED ARRAY;CIND;DIF;DIM;DSHAPE;GRADE;1I;LAST;M;N; 
RANK;RRI;SORTED;URRI 
C1] aA No. of ways for N-way reduction: 
[2] Nel(p,L)+2 
[3] a Which dimension to be "reduced''? 
[4] DIMe1tCC(N+N) JL), ~1TURANK¢p epARRAY 
C5] a Branch unless O-way reduction: 
[6] %CxN)oeL1 
C7] R¢f /CDIMJARRAY 
C8] >0 
[9] aA Separate left arg into its pieces: 
[10] L1:DSHAPE¢NoL 
[11] CIND¢NoNIL 
[12] A Begin to compute "raveled result indices": 
[13] I<¢0I0 
[14] A Index from tDSHAPE[I] to cause index error if 
[15] A invalid indices: 
C16] RRI€CLDSHAPE[IIJ)C¢’I’', ¢CIND[I]] 
C17] A Branch if origin is 1: 
[18] DIOpLOoP1 
[19] aA Continue computing RRI (N iterations for 
[20] A N-way reduction): 
[21] LOOPO0:>(NsI¢I+1)oENDLP 
C22] RRI©CLDSHAPE[II])CC¢'I’, $CINDEI1])]1+DSHAPE[I1xRRI 
[23] ~~»LOOPO 
[24] LOOP1:9(N<I¢I+1) oENDLP 
[25] RRI€CLDSHAPE(TI)(CCe#’I’,éCINDLIJ])1+DSHAPE[I]xRRI+ 71 
[26] »LOOP1 
[27] A Determine unique elements of RRI: 
[28] ENDLP: GRADE¢ARRI 
[29] SORTED¢RRI( GRADE] 
[30] LAST¢SORTED#10SORTED 
[31] LASTCCxXeLAST) oe (CeLAST)-*~OTOI]¢e1 
[32] URRI¢LAST/SORTED 


-342- 


Chapter 7 Solutions FREQUENCY COUNTS, ACCUMULATIONS AND CROSS~TABS 


V MAXRED (continued) 
[33] A Reorder ARRAY to conform with SORTED: 
[34] M¢DIM-OIO 
[35] ARRAY¢¢’/ARRAYE’ ,(Mo's;’),'GRADE’,CCRANK-M+1)0’3;'),’]’ 
C36] A Perform partitioned reduction: 
C371] DIF¢Cl/CDIMJARRAY)-L/{DIMJARRAY 
C38] DIF€CADIM=tLRANK)&DIFo.x+\ 1O9LAST 
C39] DIF¢CLAST/CDIMI[T\CDIMJARRAY+DIF)-LAST/CDIMI]DIF 
[40] A Initialize result. Fill with identity elt. 
[41] A Ravel the DSHAPE dim.s: 
C42] R€eC€C-M)0(x/DSHAPE) ,1J/MPopARRAY) ol /1i0 
[43] A Insert result of partitioned reduction: 
[44] ¢’RE’,(Me's3'),’'URRI’,(CRANK-Mt+1)0’5'),’J€DIF' 
[45] A Reshape to desired shape (unravel DSHAPE dim.s): 
[46] R¢C€C-M)ODSHAPE,11M®pARRAY)o0R 
V 


CWSID: REDUCE] 
V R¢L MINRED ARRAY;CIND;DIF;DIM;DSHAPE ;GRADE;1;LAST;M;:N; 
RANK;RRI;3;SORTED;URRI 
[1] a No. of ways for N-way reduction: 
[2] Nel(e,L)+2 
[3] aA Which dimension to be "reduced"? 
C4] DIMC1TCCN+N) LL), 1TLRANK*¢epeARRAY 
[5] a Branch unless O-way reduction: 
[6]  %CxN)JeL1 
C7] R¢L/CDIMIJARRAY 
C8] >0 
[9] a Separate left arg into its pieces: 
[10] L1:DSHAPE¢NoL 
[11] CIND¢NeNLL 
[12] a Begin to compute "raveled result indices": 
[13] I<«DIO 
[14] A Index from .tDSHAPE[I] to cause index error 
C15] a if invalid indices: 
C16] RRI€CULDSHAPELII)(¢@'I’ ,éCIND[I]] 
[17] A Branch if origin is 1: 
[18] %OI0pLOOP1 
[19] A Continue computing RRI (N iterations for 
[20] A N-way reduction): 
C21} LOOPO:7(N<I€¢I+1)oENDLP 
[22] RRI€CLDSHAPELCIIJICCe’I’ ,¢CIND[I])]1+DSHAPECI]xRRI 
[23] >LOOPO 
[24} LOOP1: >(N<I¢1I+1)oENDLP 
[25] RRI€CLDSHAPE(CTIICCe’I’ ,séCIND(CIJ)1+DSHAPECIIJxRRI+ 1 
[26] »>LOOP1 
[27] A Determine unique elements of RRI: 
[28] ENDLP: GRADE¢CARRT 
C29] SORTED<¢RRI[ GRADE ] 
[30] LAST<¢SORTED#10SORTED 
[31] LASTI(xpLAST)o(eLAST)-~OIO1¢1 
[32] URRI©€LAST/SORTED 
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V MINRED Ccontinued) 
[33] A Reorder ARRAY to conform with SORTED: 
[34] M¢DIM-OIO 
[35] ARRAY¢?’ARRAY['’,(Mo’;’'),'GRADE’ ,( CRANK-M+1)0';'),'1]' 
[36] A Perform partitioned reduction: 
[37] DIF¢«Cf/CDIMJARRAY)-L/CDIMJARRAY 
[38] DIF¢CADIM=tRANK) &DIFo.x+\ 1O0LAST 
[39] DIF¢CLAST/CDIMIL\CDIMJARRAY-DIF)+LAST/CDIMI]DIF 
[40] aA Initialize result. Fill with identity elt. 
[41] aA Ravel the DSHAPE dim.s: 
[42] R¢eC€C(-M)%(x/DSHAPE) ,1/M%pARRAY)JoeL/tO 
[43] A Insert result of partitioned reduction: 
[44] ¢@'RE',(Me’s3'),'URRI’,CCRANK-Mt+1)0’3'),'1¢DIF’ 
[45] A Reshape to desired shape (Cunravel DSHAPE dim.s): 
[46] Re((-M)®DSHAPE,11/M¢oARRAY) oR 
V 


C[WSID: REDUCE] 
V R¢L ANDRED ARRAY ;CIND;CUM;DIM;DSHAPE;GRADE;1I;LAST:M;:N; 
RANK; RRI;SORTED; URRI 
[1] aA No. of ways for N-way reduction: 
[2] Nelle,L)+2 
C3] aA Which dimension to be "reduced"? 
C4] DIM¢C1TCCN+N)I LL), “1 TLRANK<e op ARRAY 
[5] a Branch unless O-way reduction: 
C6] >(XN)JoeL1 
C7] R¢A/CDIMJARRAY 
[8] >0 
[9] aA Separate left arg into its pieces: 
[10] L1:DSHAPE+NoL 
[11] CIND¢NeNIL 
[12] A Begin to compute "raveled result indices": 
C13] I<OIoO 
[14] A Index from tDSHAPE[I] to cause index error 
[15] a if invalid indices: 
[16] RRI¢(C.DSHAPE(CII]){Ce’I’ ,¢CIND[1I]] 
[17] A Branch if origin is 1: 
[18] 0I0pLOOP1 
C19] A Continue computing RRI (N iterations for 
[20] A N-way reduction): 
[21] LOOPO:7(N<I¢I+1)oENDLP 
[22] RRI¢CLDSHAPEL[TII)ICCe#'’I’ ,sCIND[I]1)1+DSHAPELI1xRRI 
[23] -»>LOOPO 
[24] LOOP1: >(N<I¢I+1)pENDLP 
C25] RRI¢C.UDSHAPE[TI)(CCe'I’ ,sCIND[I])1+DSHAPE[I1]1xRRI+ 71 
[26] ~»LOOP1 
[27] A Determine unique elements of RRI: 
[28] ENDLP: GRADE€ARRI 
[29] SORTED¢RRI[C GRADE] 
[30] LAST¢SORTED#10SORTED 
[31] LAST((xeLAST)o( ep LAST) -~010]¢1 
[32] URRI¢LAST/SORTED 
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V ANDRED (continued) 
[33] A Reorder ARRAY to conform with SORTED: 
[34] M¢DIM-OIO 
[35] ARRAY¢?@’/ARRAY[’,(Mo'’3’),’GRADE’ ,(CRANK-M+1)0'’;'),'1’ 
[36] a Perform partitioned reduction (note: A/ €3 ~v/~): 
[37] CUM+¢LAST/(CDIM1]+\€DIM]~ARRAY 
C38] CUM<¢CUM=CeCUM)TO, [DIM]JCUM 
[39] A Initialize result. Fill with identity elt. 
[40] A Ravel the DSHAPE dim.s: 
[41] R¢eC€C€-M)%0(x/DSHAPE) ,1)/M®oARRAY)oe1 
[42] a Insert result of partitioned reduction: 
C43] ¢#’'RO’,(Mo’s3’),'URRI’,(CCRANK-M+t+1)0'3’),'’J€CUM’ 
[44] A Reshape to desired shape (unravel DSHAPE dim.s): 
[45] R¢CC-M)®DSHAPE,11M%oARRAY) oR 
V 


CWSID: REDUCE] 
V R¢€L ORRED ARRAY ;CIND;CUM; DIM; DSHAPE ;GRADE;1I;LAST;M;N;3 
RANK;RRI3;SORTED;URRI 
[1] a No. of ways for N-way reduction: 
[2] NeLlLCe,L)+2 
[3] aA Which dimension to be "reduced"? 
[4] DIM¢1itC(N+N)JL), ~1TLRANK¢o9ARRAY 
[5] aA Branch unless O-way reduction: 
[6]  >(xXNJeL1 
C7] R¢v/LDIM]ARRAY 
[8] >0 
[9] aA Separate left arg into its pieces: 
[10] L1:DSHAPE¢NoL 
[11] CIND¢NoNIL 
[12] A Begin to compute "'raveled result indices": 
C13] I<«QI0 
[14] aA Index from tDSHAPE[I] to cause index error 
[15] Aa if invalid indices: 
[16] RRI€CLDSHAPE[TI)C¢'I’ ,¢CIND([1I]] 
[17] A Branch if origin is 1: 
[18] 2%OIOeLOOP1 
[19] A Continue computing RRI (N iterations for 
[20] A N-way reduction): 
[21] LOOPO:>(NsI¢I+1) oENDLP 
C22] RRI€CUDSHAPELTI)I(CCe’I’ ,séCIND(I])]1+DSHAPE[LI1]xRRI 
[23] ~»>LOOPO 
[24] LOOP1:7(N<I¢I+1)  eENDLP 
[25] RRI€CU.DSHAPE(CIIJI(Ce’I’' ,téCINDLEI1)]1+DSHAPE[IJXRRI+ 71 
[26] »LOOP1 
[27] A Determine unique elements of RRI: 
[28] ENDLP: GRADE€4RRI 
[29] SORTED¢RRIC GRADE] 
[30] LAST¢SORTED#19SORTED 
[31] LASTL[(xeLAST)e(eLAST)-~OI01¢1 
[32] URRI¢LAST/SORTED 
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[33] 
[34] 
[35] 
[36] 
C37] 
C38] 
[39] 
[40] 
C41] 
C42] 
[43] 
C44] 
C45] 


C1] 

C2] 

C3] 

C4] 

[5] 

C6] 

C7] 

[8] 

[9] 

[10] 
[11] 
[12] 
C13] 
C14) 
C15] 
[16] 
[17] 
C18] 
[19] 
[20] 
[21] 
[22] 
C23] 
[24] 
[25] 
[26] 
C27] 
[28] 
[29] 
C30] 
[31] 


VY ORRED (continued) 
A Reorder ARRAY to conform with SORTED: 
M¢DIM-DIO 
ARRAY¢¢?’'ARRAY[',(Mo’3;'),'GRADE’ ,(CRANK-Mt+1l)o0';'),'1]' 
A Perform partitioned reduction: 
CUM¢LAST/CDIMIJ+\ECDIMJARRAY 
CUM¢CUM# (oCUM) TO, LDIM]CUM 
a Initialize result. Fill with identity elt. 
a Ravel the DSHAPE dim.s: 
R¢(€C-M)0Cx/DSHAPE) , 11M®eARRAY ) 00 
A Insert result of partitioned reduction: 
®'’RE’',(Mo’3'),’URRI’,(CCRANK-M+1)0’3'),' J¢€CUM’ 
A Reshape to desired shape (unravel DSHAPE dim.s): 
R¢(€-M)ODSHAPE,1J/M®epARRAY) oR 
V 


CWSID: REDUCE] 
V R€L APLUSRED ARRAY;CIND;CUM; DIM; DSHAPE ;GRADE;1I;LAST;M; 
N;RANK;RRI;SORTED;URRI 
A No. of ways for N-way reduction: 
Nel Co, L)+2 
A Which dimension to be "reduced"? 
DIM¢C1ITCCN+N) LL) ,OIOl 1TLRANK¢p opARRAY 
A Branch unless O-way reduction: 
>*(xNJeL1 
ARRAY¢+/CDIMJIARRAY 
A Construct milky-way result: 
R¢( (opARRAY) , (DIM-ODIO) ,N, pARRAY) , , ARRAY 
>0 
A Separate left arg into its pieces: 
L1: DSHAPE¢NoL 
CIND¢NoNIL 
A Begin to compute "raveled result indices": 
I¢OIO 
A Index from .LDSHAPE[I] to cause index error if 
A invalid indices: 
RRI¢CUDSHAPE(CIIJ)(2’I’,é¢CIND[I]] 
A Branch if origin is 1: 
»O0IOeLOOP1 
A Continue computing RRI (N iterations for N-way 
A reduction): 
LOOPO: 9 (Ns=I¢1I+1)  eENDLP 
RRI¢( LDSHAPELTII(CCe’'’I’,séCINDLIJ) J+DSHAPECI1]XRRI 
>LOOPO 
LOOP1:9(N<I¢I+1) oENDLP 
RRI©CUDSHAPELIIJ)CCe’I’ ,fFCIND[I])]+DSHAPECI]xXRRI+71 
>LOOP1 
RA Determine unique elements of RRI: 
ENDLP: GRADE¢ARRI 
SORTED¢RRIC GRADE] 


-346- 


Chapter 7 Solutions FREQUENCY COUNTS, ACCUMULATIONS AND CROSS-TABS 


VY APLUSRED (continued) 

C32] LAST¢SORTED#10SORTED 

[33] LASTL(xpLAST)o(pLAST)-~OIOI¢€1 

[34] URRI€LAST/SORTED 

[35] A Branch unless ARRAY a scalar (1.e. freq count): 

[36] 3(xRANK)eL3 

[37] a Perform partitioned frequency count: 

[38] CUM¢LAST/tpLAST 

[39] CUM€CUM-(eCUM)e(~1+01I0),CUM 

[40] A Branch if ARRAY is 1 Cusually): 

[41] 3(1=ARRAY)oL2 

[42] A Multiply freq. counts by scalar: 

[43] CUM<ARRAYxCUM 

[44] a Construct milky-way result: 

[45] L2:ReC(1 0 ,N,CoeCUM) ,DSHAPE) ,URRI-DIO),CUM 

[46] 0 

[47] A Reorder ARRAY to conform with SORTED: 

C48} L3:M¢DIM-DIO 

[49] ARRAY¢¢’/ARRAY[' ,(Mo'3;'),'GRADE’,(CRANK-Mt+1)0’;'),'1' 

[50] a Perform partitioned reduction: 

[C51] CUM¢LAST/([DIMIJ+\CDIMJARRAY 

C52] CUM¢CUM-CeCUM)TO, C(DIM]CUM 

[53] A Construct milky-way result: 

[54] R¢eCCRANK,M,N, CeCUM) , DSHAPE) ,URRI-OIO),,CUM 
V 


CWSID: REDUCE] 
V R¢L AMAXRED ARRAY;CIND;DIF;DIM;DSHAPE;GRADE;1I;LAST;M;N 
;RANK;RRI;SORTED;URRI 
C1] a No. of ways for N-way reduction: 
[2] Nel(e,L)+2 
[3] a Which dimension to be "reduced"? 
[4] DIM¢C1ITCCN+N) LL) ,OLTOl 1TLRANK<p pARRAY 
[5] a Branch unless O-way reduction: 
[6] >(xXNJeLl 
C7] ARRAY¢I /£ DIMJARRAY 
C8] aA Construct milky-way result: 
[9] R¢( CopARRAY) , (DIM-OIO) .N, epARRAY) , , ARRAY 
C10} 30 
[11] A Separate left arg into its pieces: 
[12] L1:DSHAPE¢NoL 
[13] CIND¢NeNLL 
[14] A Begin to compute "raveled result indices": 
C15] I€¢0ToO 
[16] A Index from tDSHAPE[I] to cause index error if 
[17] A invalid indices: 
[18] RRI€C.DSHAPE[IIJ)ICe'I’ ,éCIND(I]] 
[19] a Branch if origin is 1: 
[20] »0I0eLOOP1 
[21] A Continue computing RRI (N iterations for 
[22] A N-way reduction): 
[23] LOOPO:9(NsI¢I+1)oENDLP 
[24] RRI€CUDSHAPE[II)(Ce'I' ,sCIND[IJ)1+DSHAPELI1]XRRI 
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VY AMAXRED (continued) 
[25] »>LOOPO 
[26] LOOP1:>(N<I¢I+1) eENDLP 
[27] RRI€C(tDSHAPELIIJ)(Ce#'I’ ,sgCIND[EI])1+DSHAPECI]XRRI+ 1 
[28] 7»LOOP1 
[29] aA Determine unique elements of RRI: 
[30] ENDLP: GRADE©4RRI 
[31] SORTED¢RRI[ GRADE ] 
[32] LAST¢SORTED410SORTED 
[33] LASTL(xeLAST)o(eLAST)-~OI0OJ]¢1 
[34] URRI¢LAST/SORTED 
[35] aA Reorder ARRAY to conform with SORTED: 
[36] M€DIM-OIO 
[37] ARRAY¢?’ARRAY[’,(Mo'’;'),'GRADE’,(CRANK-M+1)0e'3'),']' 
[38] a Perform partitioned reduction: 
C39] DIF¢Cl/CDIMJARRAY)-L/CDIMJARRAY 
C40] DIF¢«CADIM=LRANK) &DIFo. x+\ 1O%LAST 
C41] DIF¢CLAST/CDIMIT \CDIMJARRAY+DIF)-LAST/(CDIMIDIF 
[42] A Construct milky-way result: 
[43] ReCCRANK,M,N, CeDIF),DSHAPE) ,URRI-O1I0),,DIF 
V 


CWSID: REDUCE] 
V R€L AMINRED ARRAY;CIND;DIF;DIM;DSHAPE ;GRADE;1I;LAST;M;N 

;RANK;RRI;SORTED;URRI 
[1] aA No. of ways for N-way reduction: 
[2] NeL(e,L)+2 
[3] aA Which dimension to be "reduced"? 
[4] DIM¢C1TCCN+N) LL) ,OIOF “1TURANK«¢o epARRAY 
[5] a Branch unless O-way reduction: 
[6] 2 CxN)eL1 
[7] ARRAY¢L/{DIMJARRAY 
C8] aA Construct milky-way result: 
[9] R€CCeeARRAY), (DIM-ODIO),N, eo ARRAY), ,ARRAY 
[10] 0 
[11] A Separate left arg into its pieces: 
[12] L1:DSHAPE¢NoL 
[13] CIND¢NoNIL 
[14] aA Begin to compute "raveled result indices": 
C15} I¢OIo 
[16] A Index from tDSHAPE[I] to cause index error if 
[17] A invalid indices: 
[18] RRI<CtDSHAPE(TIJ)Ce’I’ ,sCIND[I]] 
[19] aA Branch if origin is 1: 
[20] 2%DIOepLOOP1 
[21] a Continue computing RRI (N iterations for 
[22] A N-way reduction): 
[23] LOOPO:»(Ns<I¢I+1)o0ENDLP 
[24] RRI¢CUDSHAPE[TIICCe'I’,tCIND[LI])]1+DSHAPELI]xRRI 
[25] ~»LOOPO 
C26] LOOP1:>(N<I¢I+1) oENDLP 
[27] RRI©CUDSHAPE(ITI)ICCe'I’,sCIND[L1])1+DSHAPE[I]xRRI+71 
[28] LOOP 
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VY AMINRED (continued) 
[29] aA Determine unique elements of RRI: 
[30] ENDLP: GRADECARRI 
[31] SORTED¢RRIC GRADE] 
[32] LAST¢SORTED419SORTED 
[33] LASTL(xXpLAST)e(oLAST)-~OI0OJ]¢1 
[34] URRI¢LAST/SORTED 
[35] A Reorder ARRAY to conform with SORTED: 
[36] M¢DIM-OIO 
[37] ARRAY¢@’ARRAY[',(Mo’3;'),’'GRADE’',(CRANK-Mt1)0'3'),']' 
[38] A Perform partitioned reduction: 
[39] DIF¢€Cl/CDIMIARRAY)-L/TDIMIARRAY 
[40] DIF¢€CADIM=LRANK)JQDIFo.x+\ “10LAST 
[41] DIF«CLAST/[DIM]JL\CDIMJARRAY-DIF)+LAST/CDIMIDIF 
C42] A Construct milky-way result: 
[43] ReC€CRANK,M,N, CeDIF) ,DSHAPE) ,URRI-O1I0),,DIF 
V 


([WSID: REDUCE] 
V R¢L AANDRED ARRAY ;CIND;CUM;DIM; DSHAPE;GRADE;I;LAST;3;M;N 

;RANK;RRI;SORTED;URRI 
[1] aA No. of ways for N-way reduction: 
[2] NeLl(e,L)+2 
[3] a Which dimension to be "reduced''? 
[4] DIMe1tC (N+N) JL) ,OLOl ~1TLRANK¢p pARRAY 
[5] aA Branch unless O-way reduction: 
[6] %C€xN)oeL1 
C7 J ARRAY€A/[TDIMJARRAY 
[8] aA Construct milky-way result: 
[9] R¢(€ CoeARRAY) , (DIM-OUIO) ,N,o ARRAY), , ARRAY 
C10] 0 
[11] A Separate left arg into its pieces: 
[12] L1:DSHAPECNoL 
[13] CIND¢NoNIL 
[14] a Begin to compute "raveled result indices": 
C15] I<€uIo 
[16] A Index from tDSHAPE[I] to cause index error if 
[17] A invalid indices: 
[18] RRI¢CULDSHAPE([(TII])Ce'I’,t¢CIND[(I]] 
[19] A Branch if origin is 1: 
[20] ~»DIOpLOOP1 
[21] A Continue computing RRI (N iterations for 
[22] A N-way reduction): 
[23] LOOPO:>(NsI¢I+1)o0ENDLP 
[24] RRI¢(tDSHAPE[CIIJI[Ce'I’ ,éCIND[1I])]1+DSHAPE[LI]XRRI 
C25]  >LOOPO 
[26] LOOP1:>(N<I¢I+1)oENDLP 
[27] RRI€CLDSHAPECTI)J(C®#’I' ,sCIND[I])1+DSHAPELIJ]xXRRI+71 
[28] -»LOOP1 
[29] a Determine unique elements of RRI: 
[30] ENDLP: GRADE€ARRI 
C31] SORTED¢RRIE GRADE] 
[32] LAST¢SORTED#10SORTED 
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V AANDRED (continued) 
[33] LASTL[(xpLAST)e(oLAST)-~OIO1¢1 
[34] URRI©€LAST/SORTED 
[35] A Reorder ARRAY to conform with SORTED: 
[36] M€¢DIM-OIO 
[37] ARRAY¢?’/ARRAYL[’,(Mo’3;’),'GRADE’ ,(CRANK-M+1)0e0'3'),')' 
[38] a Perform partitioned reduction (note: A/ €¢% ~v/~): 
[39] CUM¢LAST/([DIM]+\CDIM]~ARRAY 
[40] CUM<CUM=CeCUM)T0O,£DIM]CUM 
[41] a Construct milky-way result: 
[42] R¢CCRANK,M.N, (eCUM) ,DSHAPE) ,URRI-UIO),,CUM 
4 


CWSID: REDUCE] 
V R€L AORRED ARRAY ;CIND;CUM;DIM;DSHAPE;GRADE;1;LAST;M;:N; 
RANK;RRI;SORTED;URRI 
[1] aA No. of ways for N-way reduction: 
[2] NeL(e,L)+2 
[3] aA Which dimension to be "reduced"? 
C4] DIM¢1ItCCN+N) LL) ,OIOl “1TLRANK«p eARRAY 
[5] a Branch unless O-way reduction: 
[6] >(xN)oL1 
C7] ARRAY¢v/CDIMJARRAY 
[8] a Construct milky-way result: 
[9] R€CCoepARRAY), (DIM-OIO),N,oARRAY),,ARRAY 
[10] 0 
[11] a Separate left arg into its pieces: 
[12] L1l:DSHAPE+NoL 
[13] CIND¢NoNLL 
[14] A Begin to compute "raveled result indices": 
C15] I¢0Io0 
[16] A Index from tDSHAPE[1I] to cause index error if 
C17] A invalid indices: 
C18] RRI€CULDSHAPELCII])C¢'I’ ,éCIND([1I]] 
C19] A Branch if origin is 1: 
[20] ~DIOpLOOP1 
[21] A Continue computing RRI (N iterations for 
[22] A N-way reduction): 
C23] LOOPO:>(N<1I¢I+1)oENDLP 
[24] RRI€CLDSHAPE[I])(Ce#’I' ,¢éCIND£I])1+DSHAPE[I1]xRRI 
[25] ~»>LOOPO 
C26] LOOP1:>(N<I¢I+1)oENDLP 
[27] RRI€CLDSHAPELII)ECCe'I’ ,tgCIND[LI])]1+DSHAPE[I]xRRI+ 71 
[28] ~>LOOP1 
[29] a Determine unique elements of RRI: 
[30] ENDLP: GRADE<ARRI 
[31]  SORTED¢RRI([ GRADE] 
C32] LAST¢SORTED#410SORTED 
[33] LASTL(xeLAST)o(eLAST)-~01I01¢1 
[34] URRI¢<LAST/SORTED 
[35] a Reorder ARRAY to conform with SORTED: 
[36] M€¢DIM-OIO 
[37] ARRAY¢@’ARRAY[',(Mo';’'),’GRADE',(CRANK-Mt+1)0';'),’]’ 
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V AORRED (continued) 

[38] A Perform partitioned reduction: 

C39] CUM¢LAST/(DIM]+\CDIMJARRAY 

C40] CUM¢CUM# CeCUM) TO, [T DIM]JCUM 

[41] aA Construct milky-way result: 

[42] R€CCRANK,M.N, CeCUM) ,DSHAPE) ,URRI-OIO),,CUM 
4 


([WSID: REDUCE] 
V REW APLUSWAY ARRAY;DIM;DS;GRADE; LAST;M;N;NDS;RANK;RRI; 


S:SORTED 
[1] aA Force W to be a vector: 
C2] We ,W 


[3] a Rank; dimension reduced (origin 0); no. ways: 
C4] RANK¢ARRAYCOIO I 

[5] DIM¢QIO+M¢ARRAYCOUIO+1] 

C6] N¢ARRAYCOIO+2 ] 

[7] a Shape of reduced array; new/old resulting shape 
[8] a of reduced dimension: 

[9] S¢ARRAY[L3+ULRANK] 

[10] DS¢ARRAY[C3+RANK)+iUN] 

[11] A New resulting shape of reduced dimension: 
[12] NDS¢+DS(W] 

[13] A Branch unless O-way reduction: 

[14] 3(xN)oLO 

C15] R¢Soe(3+RANK) LARRAY 

[16] 0 

[17] mA Result indices; reduced array: 

[18] LO: RRI¢CARRAY[ (3+RANK+N)+ .LSCDIM] ] 

[19] ARRAY¢Soe(3+RANK+N+S(E DIM] ) JARRAY 

C20] A Treat special if W is empty: 

[21]  >(O#pW)eL1 

[22] Re+/CDIMJARRAY 

[23] 0 

[24] aA Treat special if W is teDS (i.e. all dimensions): 
[25] L1:%(CeW)#eDS) eL2 

[26] »(WA.=teDS)0eL3 

[27] A New result indices: 

[28] L2:RRI¢NDS1CDSTRRIICW; ] 

[29] A Determine unique elements of RRI: 

[30] GRADE¢CARRI 

[31J SORTED¢RRICT GRADE] 

[32] LAST¢SORTED#Z10SORTED 

[33] LAST (xpLAST)o(eLAST)-~OI0OIJ¢1 

{34] RRI©LAST/SORTED 

[35] A Reorder ARRAY to conform with SORTED: 

[36] ARRAY¢¢’ARRAYL',(Mo’;'),'GRADE’,((RANK-M+1)Jo'3;'),']! 
[37] A Perform partitioned reduction: 

[38] ARRAY¢LAST/(CDIM]+\CDIMIARRAY 

[39] ARRAY©ARRAY-(CoeARRAY) TO, C(DIMJARRAY 

[40] A Initialize result. Fill with identity elt. 
[41] A Ravel the DSHAPE dim.s: 

[42] L3:R¢(C-M)OCxX/NDS) ,1J/M®oARRAY) 00 
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[43] 
[44] 
C45] 
[46] 


C1] 

[2] 

C3] 

C4] 

C5] 

C6] 

C7] 

C8] 

[9] 

[10] 
C11] 
C12] 
C13] 
[14] 
C15] 
[16] 
C17] 
C18] 
[19] 
[20] 
[21] 
[22] 
[23] 
[24] 
[25] 
[26] 
[27] 
[28] 
[29] 
[30] 
Es 
C32] 
C33] 
[34] 
[35] 
[36] 
[37] 
[38] 
[39] 
[40] 
C41] 
[42] 
[43] 
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VY APLUSWAY (continued) 


Aa Insert result of partitioned reduction: 
@'RO',(Mo'’;'),’'OIO+RRI’,((CRANK-M+1)0';'),’J¢ARRAY’ 
A Reshape to desired shape (unravel NDS dim.s): 
R¢((C-M) NDS , 1J/M®oARRAY) oR 
Vv 


CWSID: REDUCE] 
V R¢W AMAXWAY ARRAY;DIF;DIM;DS;GRADE;LAST;M;N;NDS; RANK; 

RRI;S;SORTED 

A Force W to be a vector: 

We ,W 

A Rank; dimension reduced Corigin 0); no. ways: 
RANK¢ARRAYCOIO] 

DIM¢0I0+M¢ARRAY [COIO+1) 

N¢ARRAY [OIO0+2 J 

A Shape of reduced array; new/old resulting shape 

Aa of reduced dimension: 

S¢ARRAY[3+ULRANK] 

DS¢ARRAYL[L (3+RANK)J+1UN] 

A New resulting shape of reduced dimension: 
NDS¢DS([W] 

A Branch unless O-way reduction: 
+>(XN)eLO 

R¢So (3+RANK) JARRAY 

>0 

Aa Result indices; reduced array: 
LO: RRICARRAY[ (3+RANK+N)+LSCDIMI] 
ARRAY¢Soe (3+RANK+N+SCDIM])JARRAY 
A Treat special if W is empty: 
>CO#eW)eL1 

R¢f /CDIMJARRAY 

+0 

A Treat special if W is teDS Ci.e. 
L1:3€CeW)#o0DS) eL2 

>*(CWA.=LeDS)eL3 

A New result indices: 
L2:RRI€NDS1CDSTRRIICW; ] 

A Determine unique elements of RRI: 
GRADE¢4RRI 

SORTED¢RRI( GRADE] 

LAST¢SORTED# 19SORTED 

LAST (Xp LAST) oe Ce LAST) -~OI0j]¢1 
RRI©<LAST/SORTED 

A Reorder ARRAY to conform with SORTED: 
ARRAY¢#’ARRAY[’,(Mo’;'),’GRADE’,(CRANK-M+1)0':'),']' 
A Perform partitioned reduction: 

DIF¢(f /CDIMJARRAY)-L/CTDIMJARRAY 
DIF¢CADIM=ULRANKIJQDIFoO. X+\ “1OLAST 

ARRAY¢CLAST/ (DIMI! \EDIMJARRAY+DIF)-LAST/CDIMIDIF 
Aa Initialize result. Fill with identity elt. 

A Ravel the DSHAPE dim.s: 
L3:R¢((-M)O(x/NDS) ,1/M®oARRAY) of /10 


all dimensions): 
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C44] 
[45] 
[46] 
C47] 


C1] 
[2] 
[3] 
C4] 
[5] 
C6] 
C7] 
C8] 
[9] 
C10] 
C11] 
C12] 
C13] 
C14] 
C15] 
[16] 
C17] 
[18] 
C19] 
[20] 
[21] 
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VY AMAXWAY Ccontinued) 


A Insert result of partitioned reduction: 
@'ROE’,(Mo’s3'),'OIO+RRI’,(CRANK-M+1)0'3;'),’J©ARRAY' 
A Reshape to desired shape (unravel NDS dim.s): 
R¢(C-M)ONDS ,1/M%opARRAY) oR 
Vv 


[WSID: REDUCE] 

V Rew AMINWAY ARRAY;DIF;DIM;DS3;GRADE;LAST;M;N;NDS3; RANK; 

RRI;3;S;SORTED 

A Force W to be a vector: 

We ,W 

A Rank; dimension reduced 

RANK¢ARRAY(OIO] 

DIM¢OIO+M¢ARRAY [OIO+1) 

N¢ARRAY(OTO0+2] 

Aa Shape of reduced array; new/old resulting shape 
a of reduced dimension: 

S¢ARRAY(3+ULRANK J 

DS©ARRAY[ (3+RANK)+ iN] 

A New resulting shape of reduced dimension: 

NDS€¢DSCW] 

A Branch unless O-way reduction: 

>9(xN)eLO 

R¢Se(3+RANK) JARRAY 

>0 | 

A Result indices; reduced array: 
LO: RRI¢ARRAY[ €3+RANK+N)+LSCDIMI ] 

ARRAY€¢Spo (3+RANK+N+S(CDIM]J ) LARRAY 

a Treat special if W is empty: 

»>(0#eW)oL1 

Rel /CDIMJARRAY 

>0 

Aa Treat special if W is tpeDS Ci.e. 
L1:7>(€CeW)#4o0DS) eL2 

X9C(WA.=LeDS)JeL3 

Aa New result indices: 
L2:RRI¢NDS1CDSTRRI)IW; ] 

A Determine unique elements of RRI: 

GRADE¢ARRI 

SORTED¢RRIfC GRADE ] 

LAST¢+SORTED#10SORTED 

LASTE (xp LAST) 0 (eLAST)-~OIOI¢1 

RRI¢©LAST/SORTED 

A Reorder ARRAY to conform with SORTED: 

ARRAY<¢@ 'ARRAY[’ ,(Mo'3;'),'GRADE’ ,(€ CRANK-M+1)0'3'),'’]' 
A Perform partitioned reduction: 

DIF¢(f/[LDIM]ARRAY)-L/€DIMIARRAY 

DIF¢(C ADIM=LRANK)&DIFo.x+\ 19LAST 

ARRAY¢€CLAST/EDIMIJL\CDIMIARRAY-DIF)+LAST/[DIMIDIF 

A Initialize result. Fill with identity elt. 

A Ravel the DSHAPE dim.s: 
L3:ReCC-MJOCX/NDS) , 1JM®oARRAY Jol /10 


Corigin 0); no. ways: 


all dimensions): 
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[44] 
[45] 
[46] 
[47] 


Cij 

C2] 

C3] 

C4] 

[5] 

C6] 

C7] 

C8] 

[9] 

C10] 
C11] 
C12] 
C13] 
C14] 
C15] 
C16] 
C17] 
C18] 
£19] 
[20] 
[21] 
[22] 
[23] 
[24] 
[25] 
[26] 
C27] 
[28] 
[29] 
C30] 
C31] 
[32] 
C33] 
C34] 
[35] 
[36] 
C37] 
C38] 
C39] 
[40] 
C41) 
[42] 


V AMINWAY (continued) 


A Insert result of partitioned reduction: 
¢'’RE',(Mo’;'),’'OIO+RRI’,( (RANK-M+1)0’:;'),’ J©ARRAY' 
A Reshape to desired shape (unravel NDS dim.s): 
R¢(CC-M)JONDS ,1/M%eARRAY) oR 


CWSID: REDUCE] 


V REW AANDWAY ARRAY ;DIM;DS;GRADE; LAST;M;N;NDS;RANK;RRI;S 


; SORTED 
A Force W to be a vector: 

We ,W 
A Rank; dimension reduced (origin 0); no. 
RANK¢ARRAY[OIO] 
DIM¢UIO+M¢ARRAY[OIO0O+1) 

N¢ARRAY [ULO+2 J 
A Shape of reduced array; new/old resulting shape 
A of reduced dimension: 

S¢ARRAY(3+LRANK] 

DS©¢ARRAY[ (3+RANK)+tN] 
A New resulting shape of reduced dimension: 
NDS¢DS[W] 
A Branch unless O-way reduction: 

>(XN)eLO 

R¢So (3+RANK) JARRAY 

>0 

A Result indices; reduced array: 
LO: RRI¢ARRAY[ (€3+RANK+N)+LS([DIM]] 

ARRAY¢Soe (3+RANK+N+SC DIM] ) LARRAY 

A Treat special if W is empty: 

>*(CO#eW)eL1 

R¢A/CDIMJARRAY 

+0 

A Treat special if W is teDS (i.e. all dimensions): 
L1:>€CeW)#eDS)eL2 

>(WA.=LeDS)0L3 

A New result indices: 
L2:RRI©NDS1CDSTRRIIEIW; J 

Aa Determine unique elements of RRI: 
GRADE¢4RRI 

SORTED¢RRIE GRADE] 

LAST<SORTED#1®SORTED 

LASTL (xp LAST) o(o LAST) -~OI0]¢1 
RRI¢LAST/SORTED 
A Reorder ARRAY to conform with SORTED: 
ARRAY¢¢#'ARRAY(’,(Mo'’;’'),’'GRADE' ,(CRANK-M+1)0’3'),'1I' 
A Perform partitioned reduction: 

ARRAY¢LAST/ €DIM]+\0CDIM]~ARRAY 
ARRAY¢ARRAY=(C eARRAY) TO, [DIMJARRAY 
A Initialize result. Fill with identity elt. 
A Ravel the DSHAPE dim.s: 
L3:R€CC-M)JOCxX/NDS) ,1/M®eARRAY)e1 


ways: 
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V AANDWAY (continued) 
[43] A Insert result of partitioned reduction: 
[44] ¢@’R(',(Moe’3'),’OIO+RRI'’,((RANK-M+1)0':'),’ J©ARRAY' 
[45] A Reshape to desired shape (unravel NDS dim.s): 
[46] R¢e((-M)ONDS,1/M%oARRAY) oR 

V 


[WSID: REDUCE] 
V R¢W AORWAY ARRAY;DIM;DS;GRADE;LAST:M;:N;sNDS;RANK:RRI;S;: 


Tl] 
C2] 
C3] 
C4] 
[5] 
C6] 
C7] 
[8] 
[9] 


[10] DS€ARRAYI[ (3+RANK)+1iUN] 

[11] A New resulting shape of reduced dimension: 
[12] NDS¢«DSI(W] 

(131 A Branch unless O-way reduction: 

[14] 3(€xN)peLO 

[15] R¢So(3+RANK) JARRAY 

[16] 30 

[17] A Result indices; reduced array: 

[18] LO:RRICARRAY[ (€3+RANK+N)+tSC DIM] ] 

[19] ARRAY¢Se(03+RANK+N+SC0DIM J ) LARRAY 

[20] A Treat special if W is empty: 

[21] »CO#eW)eLI1 

C22] R¢ev/CDIMJARRAY 

[23] 0 

[24] a Treat special if W is lteDS (i.e. all dimensions): 
[25] L1:>?CCeW)#eDS)eL2 

[26] >(WA.=leDS)eL3 

[271 A New result indices: 

[28] L2:RRI¢NDS1:CDSTRRI)J(W; J 

[29] a Determine unique elements of RRI: 

C30] GRADE¢A4RRI 

[31] SORTED¢RRIC GRADE] 

[32] LAST¢SORTEDZ10SORTED 

C33] LASTCECxXeLAST)Je Co LAST)-~OI0O}¢1 

[34] RRI©LAST/SORTED 

[35] a Reorder ARRAY to conform with SORTED: 
[36] ARRAY¢¢’ARRAY(’ ,(Mo'’;'),'GRADE’,(CCRANK-M+1)0'’;'),']' 
[371 a Perform partitioned reduction: 

[38] ARRAY¢€LAST/(CDIM]+\CDIMJARRAY 

[39] ARRAY¢ARRAY# (op ARRAY) TO, CDIMJARRAY 

[40] A Initialize result. Fill with identity elt. 
[41] a Ravel the DSHAPE dim.s: 

[42] L3:R¢CC-M)OCX/NDS) ,1JM®eARRAY ) 00 


SORTED 

A Force W to be a vector: 

We ,W 

A Rank; dimension reduced (origin 0); no. 
RANK¢ARRAY[UIO] 

DIM¢UIO+M¢ARRAYLCOTO+1) 

N¢ARRAY(CUTO+2 ] 


Aa Shape of reduced array; new/old resulting shape 


A of reduced dimension: 
S¢ARRAYL3+LRANK] 
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V AORWAY Ccontinued) 
[43] a Insert result of partitioned reduction: 
[44] #'’R’,€Mo’;'),'’OIO+RRI’ ,(CCRANK-M+1)0’;’'),'jJ¢ARRAY’ 
[451 A Reshape to desired shape (unravel NDS dim.s): 
[46] R¢((-M)ONDS,1JMpARRAY)oR 

Vv 


[WSID: REDUCE] 
V REA APLUS B;ARRAY;CUM;DIM;DS;GRADE; LAST;N;RANK;RRI;S; 
SORTED;URRI ; IO 
C1] a Adds together two compressed Milky-Way arrays, 
[2] aA returning a third such array. 
[3] A 
[4] aA Return left argument if right is empty: 
[5]  7(0€pB)IL1 
[6] RA 
C7] 20 
[8] aA Return right if left is empty: 
[9] L1:%(0€pA)1L2 
[10] R¢B 
[11] 70 
C12] a Extract components from left argument: 
C13] L2:O0f0¢0 
[14] RANK<¢ACO] 
[15] DIM¢Af[1] 
[16] N¢€eA[2} 
C17] A Branch unless a O-way reduction: 
[18] (xN)eL3 
[19] R€(€C3+RANK) eA), ((3+RANK) JA) +(03+RANK) JB 
[20] 20 
[21] L3:S¢A{3+ LRANK] 
[22] DS¢A[(3+RANK)+1iN] 
[23] RRI©AL(3+RANK+N)+ USC DIM] ] 
[24] ARRAY¢Se(3+RANK+N+S(CDIMI)JIA 
[25] A Include components from right argument: 
[26] S€BL3+ LRANK] 
[27] RRI€RRI,BE(3+RANK+N)+tS[DIM]] 
[28] ARRAY¢ARRAY, [DIM]So(3+RANK+N+S[DIM])JB 
[29] A Use same logic as in APLUSRED: 
[30] GRADE¢4RRI 
[31] SORTED¢RRI{GRADE] 
[32] LAST¢SORTED#10SORTED 
[33] LASTLE(CxXpLAST) 9 ~1+eLASTI¢1 
C34] URRI€¢LAST/SORTED 
[35] ARRAY¢?’ARRAY[’,CDIMo’';'),'GRADE’ ,( C(RANK-DIM+1)e’;'),' 
7 , 
[36] CUM¢LAST/{DIM]+\CDIMJARRAY 
C37] CUM¢CUM-(CeCUM) TO, {DIM1CUM 
[38] Re(CCRANK,DIM,N, (0CUM),DS),URRI),,CUM 
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[WSID: REDUCE] 
V R¢A AMAX B;ARRAY;DIF;DIM;DS;GRADE; LAST;N;RANK;RRI55; 
SORTED;URRI;0O1I0 
[1] a Adds together two compressed Milky-Way arrays, 
[2] aA returning a third such array. 
[3] A 
[4] aA Return left argument if right is empty: 
[5]  >C0€pB)1U1 
C6] R¢A 
C7] >0 
[C8] aA Return right if left is empty: 
[9] L1:%(0€eA)LL2 
[10] R¢B 
C11] 70 
[12] A Extract components from left argument: 
[13] L2:010¢€0 
[14] RANK<ALO] 
C15] DIM¢AL1] 
[16] N¢Al2] 
[17] A Branch unless a O-way reduction: 
[18] 3CxN)oL3 
C19] R€CC3+RANK) 9A), (C(03+RANK) JA) I C3+RANK)JB 
[20] 0 
C21] L3:S¢AL3+LRANK]) 
{22] DSA C3+RANK)+1UN] 
[23] RRI©AC(03+RANK+N})+ USC DIMI] ] 
[24] ARRAY¢Soe(3+RANK+N+S( DIMI) IA 
C25] A Include components from right argument: 
[26] S¢B[3+ lLRANK] 
[27] RRI€RRI,BE(3+RANK+N) + iS(.DIM]] 
C28] ARRAY¢ARRAY, [CDIM1]1Se(3+RANK+N+S[LDIM1)JB 
[29] A Use same logic as in APLUSRED: 
C30] GRADE¢4RRI 
[31] SORTED¢RRIE GRADE] 
C32] LAST¢SORTED#10SORTED 
[33] LASTL[(xpLAST)9~1+pLASTI¢1 
[34] URRI€LAST/SORTED 
[351 ARRAY<¢#'ARRAYC’,(DIMe’;'),’GRADE’,((RANK-DIM+1)0’;'),’ 
1 f 
[36] DIF¢Cl/CDIMIJARRAY)-L/EDIMIJIARRAY 
C37] DIF¢CADIM=LRANK)&DIFo.x+\ 10LAST 
[38] DIF¢CLAST/C DIMI! \CDIMJARRAY+DIF)-LAST/[DIMI1DIF 
C39] R¢eCCRANK,DIM,N,(CoDIF),DS),URRI),,DIF 


CWSID: REDUCE] 
V ReA AMIN B;ARRAY;DIF;DIM;DS;GRADE;LAST:N;RANK;RRI:S; 
SORTED; URRI ;0I0 
Cl] aA Adds together two compressed Milky-Way arrays, 
[2] aA returning a third such array. 
[3] A 
C4] A Return left argument if right is empty: 
C5] 2(0€eB)IJL1 


OF 
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V AMIN (continued) 
[6] R¢eA 
(73 >0 
[8] a Return right if left is empty: 
[9] L1l:%(0€pA)LL2 
C10] R¢B 
C11] 30 
[12] A Extract components from left argument: 
£13] L2:0IT0€0 
C14] RANK¢ALO] 
[15] DIM¢A[1] 
[16] N¢€A2] 
[17] A Branch unless a O-way reduction: 
[18] %C(xN)oL3 
[19] R¢(C(C3+RANK) pA), 0CC3+RANK) LAI LC3+RANKIJIB 
[20] 70 
[21] L3:S¢€A[3+lRANK] 
C22] DS¢A[(3+RANK)+1N] 
[23] RRI©A[ (3+RANK+N)+tS£ DIM] ] 
[24] ARRAY¢Soe(3+RANK+N+S(CDIMI)IA 
C25] A Include components from right argument: 
[26] S¢BL3+ tLRANK] 
[27] RRI€RRI,BLC3+RANK+N)+iS(£DIMJ] 
[28] ARRAY¢ARRAY, (DIM1Se(3+RANK+N+S[EDIM])1B 
[29] A Use same logic as in APLUSRED: 
[30] GRADE¢4RRI 
[31] SORTED¢RRICGRADE] 
C323] LAST¢SORTED#10SORTED 
[33] LASTECxepLAST)0~-1+eLASTIJ¢1 
[34] URRI€LAST/SORTED 
[35] ARRAY¢?’ARRAYE’,(DIMoe’;'),'GRADE’,(CRANK-DIM+1)0’;'),’ 
J , 
C36] DIF¢Cl/CDIMIARRAY)-L/CDIMIARRAY 
[37] DIF¢CADIM=LRANK) &DIFo.x+\ -10LAST 
C38] DIF¢CLAST/CDIMIL\CDIMJARRAY-DIF)+LAST/{DIMIDIF 
[39] R¢eCCRANK,DIM,N,CeDIF),DS),URRI),,DIF 


[WSID: REDUCE] 
V R€A AAND B;ARRAY;CUM;DIM;DS;GRADE;LAST;N;RANK;RRI;S; 
SORTED; URRI ;HI0 
C1] a Adds together two compressed Milky-Way arrays, 
[2] aA returning a third such array. 
C3] A 
[4] aA Return left argument if right is empty: 
[51 2(0€eB)1L1 
[6] R¢A 
C7] >0 
C8] aA Return right if left is empty: 
[9] Li: 9(0€pA)IL2 
[10] R¢B 
[11] +30 
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V AAND (Ccontinued) 
[12] A Extract components from left argument: 
C13] L2:010¢0 
C14] RANK¢A[O] 
[15] DIM¢A[1] 
[16] N€A[2] 
C17] A Branch unless a O-way reduction: 
[18] »(xN)oL3 
[19] R€¢CC3+RANK) pA), (03+RANK)JAJAC3+RANKIJB 
[20] 30 
[21] L3:S¢A[3+URANK] 
[22] DS¢A[(3+RANK)+1UN] 
[23] RRI¢AC(3+RANK+N)+ULSCDIM]] 
[24] ARRAY¢Se(3+RANK+N+S[DIMI)IIA 
[251 a Include components from right argument: 
[26] S¢BL3+tLRANK] 
[27] RRI€¢RRI,BLEC3+RANK+N)+iLSCDIM]] 
[28] ARRAY€ARRAY, {[DIM1Se(3+RANK+N+SCDIM])JB 
[29] a Use same logic as in APLUSRED: 
C30] GRADE¢ARRI 
[31] SORTED¢RRI( GRADE] 
[32] LAST<SORTED#10SORTED 
[33] LAST[C(xpLAST)e~1+oLASTI]¢1 
[34] URRI€LAST/SORTED 
[35] ARRAY¢¢’ARRAY([’,(CDIMo';'),'GRADE’,(CRANK-DIM+1)0'3'),’ 
7 r 
C36] CUM¢LAST/(CDIM1J+\(0DIM]~ARRAY 
C37] CUM¢CUM=(CeCUM)T0, [(DIM]CUM 
[38] R¢eCCRANK,DIM,N,(CeCUM),DS),URRI),,CUM 


[WSID: REDUCE] 
V R¢A AOR B;3;ARRAY;CUM;DIM;DS;GRADE; LAST:N;RANK:RRI:S; 
SORTED ;URRI ;01I0 
[1] aA Adds together two compressed Milky-Way arrays, 
[2] aA returning a third such array. 
[3] aA 
[4] aA Return left argument if right is empty: 
C5] »*(O0€oB)LL1 
C6] RA 
C7] >0 
[8] a Return right if left is empty: 
C9] L1:>»(0€pA)1L2 
[10] R¢B 
[111] 20 
[12] A Extract components from left argument: 
[13] L2:010¢€0 
[14] RANK¢AI[0] 
[15] DIM¢AL[1] 
[16] N¢€AL2] 
[17] a Branch unless a O-way reduction: 
[18] »%C€xN)eL3 
[19] R¢(€(3+RANK) eA) ,€(3+RANK) JA) VC03+RANK) IB 
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V AOR Ccontinued) 
C20] +0 
[21] L3:S¢€AC3+LRANK] 
C22] DS¢ALC3+RANK)+iUN] 
[23] RRI€AC(3+RANK+N)+1LS€DIMI]] 
[24] ARRAY¢Se(3+RANKt+N+S([DIMI)JA 
[25] a Include components from right argument: 
[26] S¢BL3+ lLRANK] 
[27] RRI€RRI,BE(3+RANK+N)+tSC0DIM] } 
[28] ARRAY€ARRAY, C(DIM]Se(C3+RANK+N+S(CDIM1I)IB 
[293 A Use same logic as in APLUSRED: 
[30] GRADE¢ARRI 
C31] SORTED¢RRIC GRADE] 
{32] LAST¢SORTED419SORTED 
[33] LASTLE(xXeLAST)9~1+oLASTI1¢1 
[34] URRI€LAST/SORTED 
[351] ARRAY¢¢’ARRAY[’,CDIMo’;'),'’GRADE' ,(CRANK-DIM+1)0'3’'),’ 
7 ’ 
[36] CUM¢LAST/[CDIM]+\CDIM]ARRAY 
[37] CUM¢CUM4 CeCUM)TO, [DIM]JCUM 
[38] R¢CCRANK,DIM,N, (eCUM) ,DS),URRI),,CUM 


V 
6. OLT0¢+1 
I1<'’ECMPH’ rTZONE (5 classes) 
I2¢'BCPS'tuTYPE (4 classes) 
I3€+/SALES°.20 1E6 5E6 (3 classes) 
Cor: I3¢0 1E6 5E6 LIOTA SALES ) 

CLIOTA is defined in Sorting and Searching chapter) 
I4¢ALLSTATES CMIOTA STATES (50 classes) 

(CMIOTA is defined in Sorting and Searching chapter) 
I5€301 304 310 322 329 \LMGR (6 classes) 


SUM¢(5 4 3 50 6,1 2 3 4 5JAPLUSRED 1,[£11SALES,(£.5] FIT 


A Returns a 3 (frequency, SALES, FIT) by 5 by 4 by 3 by 50 
A by 6 result as a compressed Milky-Way result. 


1. €3 APLUSWAY SUM)[1;] 
or 
3 3 PLUSRED 1 


2. 4 APLUSWAY SUM 
or 
50 4 PLUSRED 1,[0C11SALES,(€.5JFIT 
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3. (5 3 2 APLUSWAY SUM)C2 33353] 
or 


6 3 4 5 3 2 PLUSRED SALES,([.5]FIT 


4. C4 2 APLUSWAY SUM)[1;;] 


50 4 4 2 PLUSRED 1 


Die (2 3 APLUSWAY SUM)L3;;3] 


4 3 2 3 PLUSRED FIT 
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WRITING USER-FRIENDLY INTERACTIVE FUNCTIONS 


[WSID: INPUT] 
VY R¢CHOICES LPROMPTE PROMPT 
Prompts for a single letter. PROMPT is the 
character vector prompt. CHOICES is a character 
vector of allowable single characters to be 
entered. R is a one element vector index into 
CHOICES of the character entered or is a numeric 
scalar escape code if an escape word is typed. 
[7] aA Requires: CPROMPTE. 
[8] L1:RECPROMPTE PROMPT 
[9] aA Exit if scalar escape code: 
[10] CpeR)LO 
C11] 2%C1=eR)eL2 
[12] O¢’*x* ENTER ONE CHARACTER ONLY ' 
f13] Ll 
[14] a Convert R to index: 
[15] L2:R¢CHOICEStR 
[16] %CR<0IO+pCHOICES) 90 
C17] O¢'’*x* INVALID CHOICE. ENTER ONE OF: ' 
[18] Oell,’,’, COLO+0.5]CHOICES 
C19] -»L1 


W 
J 
DDDDD DD 


Insert the expression 
RE CR='-')/veRIe!'~? 


in the functions NINPUT and NPROMPTE just before using OVI on R. 
Insert the APL2 expression 


(CR='-')/R)e' 7! 


in the functions NINPUT2 and NPROMPTE2 just before checking R for 
numeric characters. 
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C1] 
[2] 
C3] 
C4] 
[5] 
C6] 
C7] 


[8] 

[9] 

[10] 
[11] 
C12] 
C13] 
[14] 
[15] 
C16] 
C17] 
C18] 
[19] 
C20] 
C21] 
[22] 


C23] 
C24] 


WRITING USER-FRIENDLY INTERACTIVE FUNCTIONS 


CWSID: INPUT] 


V PROPOSAL;AGES ;NAME;NKIDS;R 


Aa Illustration of input utility functions. 

Li:70 ESCAPE NAME¢CPROMPTE 'NAME: ' 

L2:320 ESCAPE NKIDS¢1 NPROMPTE 'NUMBER OF KIDS: ' 
9L2 IFC~NKIDS€0,Ll20)MESSAGE ‘'*x* Q TO 20 KIDS ONLY’ 
»L4 UNLESSXNKIDS 

L3:70 ESCAPE AGES¢NKIDS NPROMPTE 'AGES OF KIDS: ' 
9L3 IFC CAGESYV.#fAGES) vCAGESV. <0) VAGESV.>99 J)MESSAGE 

O TO 99 AGES ONLY’ 

A Allow alignment of paper: 

L4:90 ESCAPE CPROMPTE 'PRESS ENTER WHEN READY...’ 

A 3 blank lines: 


"kk 


Oo 31 o0' ' 

Ue’Dear ',NAME,’:' 

a 1 blank line: 

Oe’! 

M<+'As a proud parent of ',(éNKIDS),’' kid’ 
O¢CC1l#NKIDSIJoe's'),’ Cwhose’ 


M<¢’average age is ',é(+/AGES)+NKIDS 
Ne’), you need insurance. ' 
O¢ 31 oe’ ' 

A Allow alignment of paper: 
90 ESCAPE CPROMPTE '' 

290 ESCAPE R¢'YN’ LPROMPTE 


? 


‘GENERATE ANOTHER PROPOSAL? 


A Do another if response is Y: 
9L1l IF R=1 
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MANIPULATING DATES 


1-1!1(CCTODAYS360 MDATES)-TODAYS360 PDATES)+180 
or 
11CCTODAYS360 PDATES)-TODAYS360 MDATES)+180 


These two expressions return different results if the purchase 

date occurs on a coupon date (1 and 0 respectively). The first 
expression is correct if the buyer receives the coupon and the 

second is correct if the seller receives the coupon. 


€1000x1.001%*-/TODAYS RDATE, BDATE)-1000 


CWSID: DATES] 
V YYYYDDD¢TOYD YYYYMMDD; DD; LEAP;MM;MMDD;YYYY;01I0 
[1] aA Converts dates (YYYYMMDD) to Julian dates (CYYYYDDD 
[2] a where DDD is number of days since prior Dec. 31). 
[3] H10¢+1 
C4] DD¢100!1YYYYMMDD 
C5] MMDD¢10000!1YYYYMMDD 
[6] MM¢ (MMDD-DD)+100 
[7] A Year and year times 1000 (e.g. 1986000): 
C8] YYYY¢CYYYYMMDD-MMDD)+10000 
[9] YYYYDDD<1000xXYYYY 
[10] a Add in days from start of month, and from start 
[11] A of year to start of month: 
C12] YYYYDDD¢YYYYDDD+DD+(C0O 31 59 90 120 151 181 212 243 273 
304 334) (MM] 
[13] A Determine whether a leap year: 
[14] LEAP¢CO=41YYYYIACOFLOOLYYYY)VO=400I1YYYY 
[15] aA Add in leap day if month is March or later: 
[16] YYYYDDD¢YYYYDDD+LEAPAMM=3 
V 
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C[WSID: DATES] 

V YYYYMMDD¢FROMYD YYYYDDD;DD;DDD;FEB29;LEAP;MM;YYYY;U01I0 
C1] a Convert Julian dates (YYYYDDD where DDD is number of 
(2] aA days since prior Dec. 31) to YYYYMMDD dates. 
[3] OTO¢] 
[4] DDD¢1000 1 YYYYDDD 
[5] a Year and year times 10000 (e.g. 19860000): 
C6] YYYY©CYYYYDDD-DDD)+1000 
C7] YYYYMMDD¢1lOOOOXYYYY 
[8] aA Determine whether a leap year: 
[9] LEAPC CO=41YYYYIACOFLOOIYYYY)VO=4001YYYY 
[10] aA Is day a leap day (i.e. Feb. 29)? 
[11] FEB29¢<LEAPADDD=60 
[12] A Subtract leap day if Feb. 29 or later to determine 

month: 
C13] DDD<DDD-LEAPADDD=260 
C14] MM¢eC31 28 31 30 31 30 31 31 30 31 30 31 /112)TDDD] 
[15] a Days since start of month: 
[16] DDe¢DDD-CO 31 59 90 120 151 181 212 243 273 304 334)CMM 
J 

[17] A Add back one day if Feb. 29: 
[18] DD«¢DD+FEB29 
C19] YYYYMMDD¢YYYYMMDD+DD+100xXMM 

V 


4. WKDAYS¢7 9o0'MONDAY TUESDAY  WEDNESDAYTHURS...SUNDAY 
WKDAY¢WKDAYS[OI0+711+TODAYS FROMQTS 300TS;] 
CWKDAY#!' ')/WKDAY 
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WRITING REPORTS 


1. H1¢15 15 3 HEADINGS 'nLAST YEARNTHIS YEAR’ 
HDG¢ ' nAVG. “SALENTOTAL¢SALESNAVG. ¢SALENTOTAL¢SALES ’ 
HDG¢HDG., 'NGROWTH¢IN¢TOTAL¢SALES ' 
HDG¢6 7 6 7 7 2 3 2 3 HEADINGS HDG 
HDG[1 2;:l11loH1]+H1 


oes A Format the date: 
FDATE¢, '’G<Z9/99/99>' OFMT DATE A APL*PLUS or SHARP APL 
FDATE¢’55/55/50'8DATE aA APL2 
FDATE¢1 101101 1\6 O&SDATE a Other APL systems 
FDATE[T3 61]¢'/' a Other APL systems 


TITLE¢€’>PAGE ',(CSPNO),'NNFINANCIAL SUMMARYN’ , FDATE 


TITLECTITLE, ’NWESTERN REGION’ 
TITLE¢65 TITLES TITLE 


3. Approach 1 (using the newline character): 
TCNL¢OTCNL A APL*APLUS 
TCNL¢ODAVO156+0I0] A SHARP APL 
TCNL¢OTCL1+0OI0O] A APL2 
TCNL¢OAVE???] A Other APL systems 


CMAT¢('4(0I5,X2),<',TCNL,’>,4F7.1') OFMT 3 8oNMAT A APL*PLUS, 
SHARP APL 


CMAT¢(€280'55550 %'),TCNL,280’ 5550.0')SNMAT A APL2 


CMAT¢((C8e7 0),807 1343 S8eNMAT a Other APL systems 
CMAT¢(O 213 28TCMAT),TCNL,3 ~28TCMAT A Other APL systems 


Approach 2 (using dyadic transpose): 


CMAT¢(602 0306 2802 1 384 6 7e(1207 0 7 1)88NMAT 
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1. The following are good candidates for "visual representation 
manipulation" functions. Their listings are not presented here 
but are available on disk. See the Postscript at the end of the 
book. 


Syntax: NEWVRCUNLAMP OLDVR 


The UNLAMP function removes all comments from the visual 
representation. Both end-of-line and full-line comments are removed 
completely, including the comment symbol (A). The function lines are 
renumbered as needed to allow for deleted full-line comments. UNLAMP 
is different from UNCOMMENT in that UNCOMMENT does not delete the 
comment symbol on full-line comments and so has no line-renumbering. 


Syntax: NEWVR¢COBFUSCATE OLDVR 


The OBFUSCATE function modifies all local identifiers (labels, result 
variable, argument variables, localized variables) by prefixing their 
names by the characters ‘AA’. In this way, the identifiers of the 
function are obfuscated so that chances of a name conflict are 
minimized when the function uses execute (¢#) to access variables or 
functions whose definitions are global to the function being 
obfuscated. WSDOC is an example of an obfuscated function. 


Syntax: NEWVR<CUNOBFUSCATE OLDVR 


The UNOBFUSCATE function deletes all occurrences of the characters 
'AA', thereby undoing the effects of OBFUSCATE. Since UNOBFUSCATE 
does not limit its seach to just local identifiers, it will delete 
even those occurrences of 'AA’ which were not inserted by OBFUSCATE 
(such as in character constants, in comments or in the middle of 
identifier names). 
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Syntax: NEWVR¢UNDIAMOND OLDVR 


The UNDIAMOND function breaks all multi-statement lines (© delimited) 
into single-statement lines, renumbering lines as needed. For 
example: 


V TEST V TEST 


C1) A¢0 © BeAt+2 O CEAtTB > f1]J A¢0 
[2] a Proceed: > C2] BeAt+2 
[3] CALC > C3] C¢A+B 
V [4] aA Proceed: 
[5] CALC 
V 


The UNDIAMOND function may be used when moving a function from an APL 
environment which supports statement separators (°) to one which does 
not; or you may find single-statement lines easier to read than 
multi-statement lines. 


[WSID: WSDOC] 
V WSDOC AAPAGE$;AAB3; AABOTTOM; AAC; AAD; AADATA;$ AADONE3 AAF; 

AAFIRST $; AAFNS 3 AAFOOT 5 AAHEIGHT 3; AAT 5 AAIND; AALAST 5 AALEFT ; 
AALEN $3; AALIM; AALINES 3; AAMARGIN ; AAN 5 AANAME 3 AANL3 
AANONDISPLAY 3; AAP; AAPNO; AAQUOTE $; AAR$3 AAS 3; AAT 3 AATCNL; 
AATITLE 3; AATOP; AATXT 3 AAVARS 5 AAVR3$ AAW 3 AAWI DTH 

[1] a Displays paged WS documentation. All output 

[21 a is via O€ so replace all O¢ by custom fn 

[3] aA (e.g. PRINT) to redirect output. PAGE: rows, 

C4] a columns, margins (top, bottom, left, right). 

C5] AATOP©CAAPAGEL2+0I0] 

C6] AABOTTOM¢AAPAGEL3+0I0] 

C7] AAHEIGHT¢AAPAGELOIO]-AATOP+AABOTTOM 

C8] AALEFT¢AAPAGEL[4+0I0] 

[9] AAWIDTH¢AAPAGEL1+01I0]-AAPAGE[5+0I0]7 

[10] aA Construct newline character: 

C11] AATCNL¢OTCNL A APL*PLUS 

C12] A TCNL€OTCL1+0T0)] A APL2 

[13] A TCNL¢ODAV(156+0OI0] A SHARP APL 

[14] A Format today’s date: 

C15} AAD¢sOTS{1 2 0 +0I0O] 


C16] AADECAAD=' ')/tpAADI€'/' 
[17] A Format the time: 
[18] AAT€CSOTSL3+0OI0I3),'’:'’, 2T'O’ ,s0TSL4+01I0) 


[19] aA Format the WSID: 

[20] AATITLE¢COWSID A APL*PLUS 

TITLE<2 OWS 1 A SHARP APL 
S€100 OSVO 'C’ aA APL2 CTSO) 

S¢0 011 O08SvVC 'C’ a APL2 C(TSO) 
C¢’')WSID’ A APL2 (TSO) 
TITLE¢CLOIO:I]~' ' @A APL2 CTSO) 


fa 
NO 
W 
Ls 
DDdDDD 
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VY WSDOC Ccontinued) 

[26] A S¢OSVR 'C’' aA APL2 (TSO) 

[27] a Provide WSID as left arg otherwise: 

[28] a TITLE<CWSID A APL2 (CMS) 

[29] A Delete leading/trailing blanks: 

[30] AATITLE€(-+/A\OAATITLE=' 'JLC+/A\AATITLE=' ')LAATITLE 

[31] A Format page title: 

[32] AATITLE¢CAATOPPAATCNL) , AAWIDTHT CAALEFT 9’ '),AATITLE,’ 
* ',AAD,’ ',AAT 

[33] A Insert page number: 

[34] AAPNO¢€1 

[35] AAT€’ PAGE 1’ 

[36] AATITLE[ C-pAATITLPAATITLEICAAT 

[37] a Build first page: 

C38] AATXT€CAATITLE, AATCNL 

[39] A Keep track of lines used so far (below 

[40] a top margin) in TXT: 

C41] AALINES€2 

[42] aA Build nondefault environment: 

C43] AAT€AATCNL, CAALEFToe'’ '),'NONDEFAULT WORKSPACE 
ENVIRONMENT: ’ ,AATCNL 

[44] AAMARGIN€(AALEFT+3)0' ' 

C45] AAQUOTE¢€’''!' 

[46] A Define chars which don’t display normally: 

[47] AANONDISPLAY¢OTCNL,OTCLF,OTCBS ,OTCBEL,OTCDEL,OTCNUL, 
OTCESC,OTCFF A APL*PLUS 

C48] A AANONDISPLAY¢OAVLOIO+0 1 156 158 159] A SHARP APL 

[49] A AANONDISPLAY¢OTC A APL2 

[50] A Format nondefaults: 

[51] a Branch if default OLX: 

[52] > CeAADATAC, OLX) LAAL2 

[53] A Replace nondisplayable chars by @: 

[541] AADATAL CAADATAEAANONDISPLAY)/ LP AADATAI€'H’ 

[55] a Double up quote chars: 

[56] AADATACAAQUOTE, (€C1+AADATA=AAQUOTE) /AADATA) , AAQUOTE 

[571 A Width available (after ' OLX’): 

C58] AAWCAAWIDTH-AALEFT+7 

[59] aA Branch if data will fit on a single line: 

[60] >CAAW2eAADATA)/AAL1 

[61] A Else truncate and show ’...': 

C62] AADATA€¢( CAAW+ 73) 0AADATA),'’...' 

[63] AAL1: AATCAAT, AATCNL, AAMARGIN, 'OLX¢€’' ,AADATA 

[64] AAL2: AATCAAT, (14010) /AATCNL, AAMARGIN, 'OI0O¢’ , s01I0 

C65] AATCAAT, €10740PP)/AATCNL, AAMARGIN, 'OPP¢<’ ,s0PP 

C66] AAT¢AAT, €1680740RL) /AATCNL, AAMARGIN, 'ORL¢’ , SORL 

C67] A Perform precise comparison for OCT: 

C68] AACceOcT 

[69] OCT<oO 

[C70] AATCAAT, CAAC#2% 46) /AATCNL, AAMARGIN, 'OCT¢€!’ , SAAC 

C71] A Use 1E°™13 instead of 2* 46 for APL2 

[72] OCT¢AAC 

[73] A Other nondefault workspace environment 

[74] A parameters which may be included are: state 

[75] A indicator, workspace size, workspace available, 
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V wSbDOC (continued) 
[76] A symbols reserved and used, error latent 
[771 Aa expression (trap definition), attention latent 
[78] A expression, etc. 
[79] A 
C80] A Include blank line after any nondefaults: 
[81] AAT¢AAT, AATCNL 
C82] A Attach nondefaults, if any, to page: 
C83] AAN¢€+/AAT=AATCNL 
[84] AANCAANXAAN>3 
C85] AATXT©AATKT, CXAAN) /AAT 
[86] AALINES<AALINES+AAN 
C87] A 
C88] A Variables: 
[89] AAVARS¢ONL 2 
[90] A Squeeze out local (WSDOC) variables: 
[91] AAVARS€CAAVARSEL; O 1 +0T0O)]V.#'A')AFAAVARS 
[92] AAMARGINCAALEFTp’ ' 
[93] A Branch if no variables: 
[94] »C1TeEAAVARS) LAAL7 
[C95] AATXT¢AATXT, AATCNL, AAMARGIN, 'GLOBAL WORKSPACE 
VARIABLES: ' ,AATCNL 
[96] AALINESCAALINES+2 
[97] A Sort variable names if not already: 
[98] A VARS¢VARS[OAVAVARS; J 
C99] A Right justify variable names: 
[100] AAT*¢AAVARS+.=!' ' 
[101] AAVARS¢(C-AAT)®AAVARS 
[102] a Drop leading all blank columns (possible by 
[103] a deleting vars): 
[104] AAVARS€(0,L/AAT) LAAVARS 
[105] A Indent variable names for left margin plus 3: 
[106] AAVARS€CCCITePEAAVARS) ,AALEFT+3)0' '},AAVARS 
C107] a Include heading: 
C108] AATCAATCNL, (CC-1LLOAAVARS)T’NAME'),' ¢€ SHAPE op VALUE’ 
> AATCNL 
C109] AATXT¢AATXT, AAT, CC-1LJeEAAVARS)T'----'),' 8 ----- = 


r 


— oe ee 


[110] AALINES¢€AALINES+2 

[111] A Loop by variable: 

C112] AAT¢OIO+ 71 

C113] AALIM¢C1TeAAVARS )-~O1I0 

[114] AAL3:7>CAALIM<AAT€CAAT+1)/AAL6 

[115] A Format shape: 

[116] AAS¢CEEAADATACP&AANAMECAAVARSEAAT31),' © ' 
[117] A Omit shape and ep if a scalar; pad to line up 
C118] A with o's: 

[119] AAS€C-11f pAASITC3<eAAS)/AAS 

[120] A Combine name and shape; compute remaining width: 
[121] AAS€AANAME,' €',AAS 

[122] AAWCAAWIDTH-peAAS 

[123] a Branch if a numeric variable: 

[124] +CO=1TOPAADATA) CAALA4 
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V 
C125] 
[126] 
[127] 
C128] 
[129] 
[130] 
[131] 
[132] 
[133] 
C134] 
[135] 
C136] 
[137] 
C138] 
[139] 
[140] 
C141] 
C142] 
C143] 
[144] 
C145] 
[146] 
C147] 
[148] 
[149] 
C150] 
[151] 
[152] 
C153] 
[154] 
[155] 
[156] 
[157] 
C158] 
[159] 
[160] 
[161] 
[162] 
C163] 
[164] 
[165] 
[166] 
[167] 
C168] 
[169] 


[170] 
C171] 


[C172] 


C173] 


WSbDOC (continued) 
a Else data is character; consider only up to 
A W chars: 
AADATA€ ( AAWL X/ 9 AADATA) pAADATA 
A Replace nondisplayable chars by &: 
AADATAL CAADATAEAANONDISPLAY) /LPAADATAI€' BH’ 
A Double up quote chars: 
AADATACAAQUOTE, (€C1+AADATA=AAQUOTE ) /AADATA) , AAQUOTE 
a Branch if data will fit on a single line: 
> (CAAW2eAADATA) /AAL5 


A Else truncate and show ’...'’: 
AADATA€ ( CAAW+ 3) @AADATA),'’...! 
2AALS5 


Aa If data is numeric, 
A 1+W+2 elements: 
AAL4: AADATA€( (14+I AAW+2)LX/oAADATA) pAADATA 
Aa Format numbers to 10 digits (CLEAR WS default): 
AAP¢OPP 
OPP<10 
AADATAfCEAADATA 
OPP¢AAP 
A Format value from empty array as 10: 
AADATACAADATA, CO=eAADATAI)/' 10! 
Aa Branch if data will fit on a single line: 
> ( AAW2 0 AADATA)/ AALS 
A Else truncate at last space within width and 
A show '...’: 
AADATA€ CC+/V\! 
Q 
A Append variable definition to page: 
AALS: AATXT©¢AATXT, AATCNL, AAS, AADATA 
AALINES©€AALINES+1 
A Branch for more vars unless bottom of page: 
> (CAALINES<AAHEIGHT)/AAL3 
A Display page: 
USAATXT, CAABOTTOM+ AAHETGHT~AALINES ) pPAATCNL 
A Format new page: 
AATXT©CAATITLE, AATCNL 
AAPNO¢AAPNO+1 
AAT€' PAGE ' ,SAAPNO 
AATXTLE (-pAAT) TLPAATITLEICAAT 
AALINES€2 
a Branch if no more vars: 
>CAATZ2AALIM) /AAL7 
A Else insert new heading: 
AATXT©CAATXT, AATCNL, AAMARGIN, 'GLOBAL WORKSPACE 
VARIABLES CCONT.):' ,AATCNL 
AALINES¢AALINES+2 
AAT¢CAATCNL, CC-1LeAAVARS)T'NAME'),!’ € 
> AATCNL 
AATXTEAATXT, AAT, CC-~1LeAAVARS )T'----'),’ 


f 


consider only up to 


'=0CAAW+ 3) ePAADATA) PAADATA),'...' 


AALINES¢AALINES+#2 
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C201] 
[202] 
[203] 
[204] 
[205] 
[206] 
[207] 
C208] 
[209] 
[210] 
[211] 
C212] 
[213] 
[214] 
C215] 
C216] 
[217] 
C218] 
{219] 
C220] 
C221] 


[222] 
C223] 
C224] 
[225] 


WORKSPACE DESIGN AND 


WSDOC (continued) 
Aa Branch for more vars: 
2AAL3 
AQ 
A No more vars Cinclude blank line): 
AAL6: AATXTCAATXT, AATCNL 
AALINES¢€AALINES+1 
A 
A Functions: 
AAL7: AAFNS¢CONL 3 
A Squeeze out this (WSDOC) function: 
AAFNS¢ CAAFNSYV.#C1LIeEAAFNS)T'WSDOC' )#AAFNS 
A Use next, not prior, line if CRAVR is used below: 
A FNS¢CA/FNSV.#802,1LoFNSIT2 Se 'WSDOCCRAVR’ )#FNS 
A 
a Branch if some fns: 
> ( XAALIM¢1T PAAFNS ) /AAL8 
A Exit if nothing on page: 
> (CAALINES=2)/0 
A Else display page and exit: 
O<AATXT, CAABOTTOM+ AAHEIGHT-AALINES ) pAATCNL 
30 
A Branch if at least 4 lines left on page: 
AAL8: >CAALINES SAAHEIGHT+7~ 4) /AAL9 
A Else display page: 
O<AATXT, CAABOTTOM+ AAHEIGHT-AALINES ) pPAATCNL 
A Format new page: 
AATXT¢CAATITLE , AATCNL 
AAPNO©CAAPNO+1 
AAT¢' PAGE ' ,¢éAAPNO 
AATXTL (-pAAT) TLCPAATITLEICAAT 
AALINES€2 
AQ 


AALY: AATXTCAATXT , AATCNL, AAMARGIN, '’ FUNCTIONS: ' , AATCNL 


AALINES¢CAALINES+2 

A Sort fn names if not already: 

A FNS¢FNSCOAVAFNS; J 

A 

Aa Pad fn names with 3 leading blank columns: 


AAFNS€(O ~3 -oeAAFNS)TAAFNS 
A How many "columns" of fn names will fit across pg? 
AAW¢11o0AAFNS 


AAN¢L AAWIDTH+AAW 

A How many rows will this take? 

AAR¢T AALIM+AAN 

A Pad bottom of fn list for subsequent reshape: 
AAFNS€( CAARXAAN) , AAW) TAAFNS 

A Shuffle fn list to get names in desired order: 
AAFECCAAR,AALEFT)0' '),CAAR,AANXAAW)0COIO+ 1 0 2)&C 
AAN, AAR, AAW) pAAFNS 

A How many will fit on this page? 
AAL10: AAN€ CLO PAAF) L AAHETGHT-AALINES 

A Stick them on: 
AATXT¢CAATXT, , AATCNL, CAAN, 1LoAAF) TAAF 
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V wsboc (continued) 
[226] AALINES¢AALINES+AAN 
[227] A Drop them off: 
[228] AAF¢CAAN,O)LJAAF 
[229] a Branch if none left: 
[230] > C1ITPAAF)LAAL11 
[231] A Display page: 
[232] OCaAaATXT, AABOTTOMeAATCNL 
[233] A Format new page: 
[234] AATXT¢CAATITLE, AATCNL 
[C235] AAPNO¢CAAPNOt1 
[236] AAT€'PAGE ' ,SAAPNO 
[237] AATXTLEC-pAAT)TLPAATITLEI¢AAT 
C238] AALINES€2 
[239] AATXT€AATXKT, AATCNL, AAMARGIN, '’FUNCTIONS CCONT.):'’, 
AATCNL 
[240] AALINES©AALINES+2 
[241] A Put remaining fns on this page: 
[242] ~»AAL10 
[243] A 
[244] Aa Include 5 blank lines after fn list: 
C245] AAL11: AAN€C5LAAHEIGHT-AALINES 
[246] AATXT¢CAATXT, AANOAATCNL 
[247] AALINES©AALINES+t+AAN 
[248] A 
[249] a Loop by function (keep track of ist one for 
[250] a footnote): 
[251] AAFIRST¢AAFNS(OIO; J 
[252] AAFIRST¢CAAFIRST#! ')/AAFIRST 
[253] AALAST¢'! 
[254] AAI€OIO+ 1 
[255] AALIM€AALIM-~QOIO 
[256] AAL12: > (AADONECAALIM<AAT©CAAT+1)/AAL18 
[257] AANAMECAAFNS[AAT;] 
[258] AANAME<CAANAME#!’ ')/AANAME 
[259] AAVRCOVR AANAME A APL*APLUS 
C260] A VR¢1 OFD NAME aA SHARP APL 
[261] A VR€CRAVR OCR NAME aA APL2 
[262] aA Get next fn if this one locked: 
[263] > CepAAVRILAALI2 
[264] A Find newline characters: 
[265] AANL€ECAAVR=AATCNL) /LEAAVR 
[266] A Lengths of lines: 
[267] AALEN€ 1+AANL- 1) C00I0+°1),AANL 
[268] aA Branch if no lines too long: 
C269]  %*CV/AATCAALEN>AAWLDTH-AALEFT) LAAL15 
[270] A Retain starting indices and lengths of too 
[271] a long lines: 
[272] AANLEAAT/ “11 C0I0O+~1),AANL 
[273] AALENCAAT/AALEN 
[274] a Flag chars which shouldn’t be broken when 
[2751 a contiguous: 
[276] AABCAAVRE 'ABCDEFGHIJKLMNOPORSTUVWXYZAabcdefghijklmnop 
qrstuvwxyzA0123456789. 0!’ 
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wWSbDOC (continued) 
a Break points exist everywhere but where these 
A chars occur and are followed by another one of 
A these chars: 
AABCAABALDAAB 
Aa Indices of break chars found so far: 
AAIND€1t0 
RA Relative indices of first WIDTH chars 
A following known breaks: 
AAR€(C*~0I0)+®LAAWIDTH-AALEFT 
A Displacement to next break points: 
AAL13: AAD€+/V\AABCAANLOC.+AAR] 
A Use full length if break point under 1/2 line: 
AAD CAAD<(CePAAR)+2)/LEAADIC PAAR 
A Update vars for new break points: 
AANLEAANL+AAD 
AAIND¢€AAIND, AANL 
AALEN¢AALEN-AAD 
A Any lines still too long (Caddl 6 space indent)? 
Aa Branch if not: 
AAR¢CAAWIDTH-AALEFT+6 
AAT¢AALEN>AAR 
AALEN¢AAT/AALEN 
+(e AALEN) LAAL14 
AANL€AAT/ AANL 
AAR€(~+0)IO)+OL AAR 
A Repeat: 
XAAL13 
A Build replication vector to insert line breaks: 
AAL14: AAR€CEAAVRI 01 
A Allow 8 positions for char, newline, 
AARTAAINDI]€8 
A Expand visual representation: 
AAVR©AAR/AAVR 
A Adjust indices for new VR: 
AAINDCAAIND[ AAAINDI]+7XCLOAAIND)-OLIO 
A Insert break characters: 
AAVRIAAINDo. +(~010)+171¢( (eAAIND) , 7) CAATCNL, 60! 
A Redefine NL: 
AANL€( AAVR=AATCNL)/LOAAVR 
A Branch if no left margin: 
AAL15: %AALEFTIAAL16 
A Build replication vector to insert left margin: 
AAR¢(COAAVRIe1 
A Allow positions for margin: 
AARL “1LAANLI©AALEFT+1 
A Expand visual representation: 
AAVR€AAMARGIN, AAR/AAVR 
Aa Adjust newline indices for new VR: 
AANL©AANL+AALEFTX CLP AANL) +~0I0 
Aa Insert left margin: 
AAVRE CT 1IAANLIJ°.+(~0IO)J+LTAALEFTI€!' ' 
a Branch if fn won't fit on current page: 
AAL16: >(AAHETGHT<1+AALINES+pAANL) /AAL18 


6 spaces: 
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V 
C330] 
C331] 
[332] 
[3334 
[334] 
[335] 
[336] 
[337] 
[338] 
[339.1 
[340] 
[341] 
[342] 


[343] 
[344] 
[345] 


[346] 
[347] 
[348] 
C349] 
[350] 
[351] 
[352] 
[353] 
[354] 
C355] 
[356] 
C357] 
C358] 
[359] 
C360] 
C361] 
C362] 
C363] 
[364] 
[365] 
[366] 
[367] 
[368] 
[369] 
[370] 
C371] 
C372] 
[373] 
[374] 
[375] 
[376] 
[377] 

Vv 
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wspoc (continued) 
AAL17: AATXTCAATXT , AATCNL, AAVR 
AALINES©AALINES+1+9AANL 
AALAST©AANAME 
A 
a Incl. 2 blank lines after each fn (1 in VR 
A already): 
AAN€1L AAHETGHT-AALINES 
AATXTCAATXT, AANOAATCNL 
AALINES©CAALINES+AAN 
Aa Get next fn: 
2AALI12 
A Prepare footnote: 
AAL18 : AAFOOTCAAFIRST, ( ( (OAAFIRST) #OAALAST) VAAFIRSTY. #( 
OAAFIRSTITAALAST)/' > ' ,AALAST 
AAFOOT€ (Xp AALAST) / C-AAWIDTH) TAAFOOT 
A Display page: 
O<AATXT, ( CAAHEIGHT-AALINES ) pPAATCNL) , AAFOOT, AABOTTOMo 
AATCNIL 
a Exit if no functions left: 
>AADONE/ 0 
A Format new page: 
AAFIRST¢AANAME 
AATXT¢CAATITLE, AATCNL 
AAPNOCAAPNO+1 
AAT¢’ PAGE ’° ,SAAPNO 
AATXTLE (-pAAT) TLPAATITLEICAAT 
AALINES€2 
A Branch if fn will fit on the new page (with 
A footnote): 
> (AAHETGHT21+AALINES+eAANL) /AAL17 
A Flag newlines which end entire (not broken) 
a fn lines: 
AABCCAAVRE C1L+AALEFT) +7 1LAANLI='(€'),1 
A Compute no. of newlines to take for current pg: 
AAN€+/V\OCAAHEIGHT-2+AALINES ) pAAB 
AALINES©AALINES+AAN+1 
AATCAANLAANL 
A Compute no. of chars to take for current page: 
AAN€(~0I0)+AANLEAAN-~OIO] 
AANL€AAT-AAN 
A Include these chars on page and squeeze from VR: 


AATXT©AATXT, AATCNL, AANOAAVR 

AAVR¢EAANLAAVR 

AALAST¢AANAME 

A Include fn name at top of remainder of VR: 
AAT¢CAALEFTp!’ '),' V ',AANAME,’ CCONT.)’ ,AATCNL 
AAVR©AAT, AAVR 

AANL€ CAATLAATCNL) , AANL+ AAT 

A Branch to display page: 


2AAL18 
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[WSID: USEDBY] 

V USEDBY AAFNS;0103; AAALP; AAAN; AAARGS 3; AAB3; AABLL3 AAC; AACOLS 
> AAGDSP; AAGLOBAL; AAGNUM 3; AAHDR3; AAT 3; AAIDLEN 3; AAIDS; 
AAIDSTART 3; AAIDTYPES 3; AAIND; AAINDENT 3; AAINDEX 3 AAJ 3; AALAB; 
AALCGLOBAL; AALCPARSED; AALDSP3; AALEN ; AALIM; AALNUM; AALOC; 
AALOCAL;: AALTYPE :; AAN 3; AANAME ; AANAMES 3 AANC 3; AANCCON 3; AANCMT 
sAANID; AANL; AANQ 3 AANUM3; AAPAN 3s AAPARSE 3; AAPARSED3; AAPBL; 
AAPID; AAPLAB; AAPNUM;3 AAPSID3 AAR; AARESULT 3 AARVAR; AAS; 
AASTART 3; AAT 3 AATCNL; AAVR 


A Displays chart of fns and vars called by the fns 
A specified in the character matrix or vector 

Aa (delimited by spaces) argument. Requires subfns: 
A CMIOTA; Cand CRAVR if not APL*PLUS or SHARP APL). 
A Use origin 1: 

OTO¢+1 

Aa Indent per level: 

AAINDENT¢€3 

a Branch unless fns a matrix: 

>(2#eppAAFNS )OCAALI 

A Left justify char mat: 

AAFNS€(+/A\AAFNS=' 'JOAAFNS 

PAAL2 


A Flag blank before and last char of each name: 
AAL1: AAFNS€’ ',AAFNS 
AABCAAFNS=!' ' 
AAT€ (AAB#10AAB)/LOAAB 
AAT€( CC 9AAT)+2),2) 0AAT 
A Lengths and starts of each name: 
AALEN€-/®AAT 
AASTART€AATL 31] 
a Number of rows and cols in desired matrix: 
AAR ep AALEN 
AAC¢O! [ /AALEN 
A Blank, raveled array: 
AATEC CAARXAAC) oe! ' 
A Compute indices: (LLEN[1]),(tLEN[2]),... 
AAI€ AALEN/~-~110,+\AALEN 
AAICAAT+L PC AAI 
A Insert fn names into raveled matrix: 
AATL[AAI+AALEN/AACX~1+lLOAALENI©CAAFNS[ AATI+AALEN/AASTART]I 
A Reshape to matrix: 
AAFNS€(C AAR, AAC) AAT 
A Exit if no fns specified: 
AAL2: AAR€1eEPAAFNS 
>AARLO 
Aa Construct newline character: 
AATCNL¢UTCNL A APL*PLUS 
AATCNL¢€OTCL1i+0I0O] A APL2 
AATCNL¢OAV[L156+0I0] A SHARP APL 
Initialize tracking variables... 
Character matrix of all distinct identifiers 
found so far: 
AANAMES¢(CAAFNS CMIOTA AAFNS)=LAAR) #AAFNS 


DDDDD 
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V USEDBY (continued) 
[45] a Index (into NAMES) vector of fns analyzed 
[46] a so far (°1=initial call): 
[47] AAPARSED¢e, 1 
[48] aA Index Cinto NAMES) vector of global names for 
[49] aA fn in PARSED: 
[50] AAGLOBAL©CAANAMES CMIOTA AAFNS 
[51] a No. of globals in GLOBAL for fn in PARSED: 
[52] AAGNUM¢AAR 
[53] A No. of globals in GLOBAL before those for fn 
[54] A in PARSED: 
[55] AAGDSP<,0 
[56] A Index (into NAMES) vector of local names for 
[571 A fn in PARSED: 
[58] AALOCAL¢€L0 
[59] A No. of locals in LOCAL for fn in PARSED: 
[60] AALNUM¢,0 
[61] A No. of locals in LOCAL before those for fn 
C62] A in PARSED: 
C63] AALDSP¢,0 
[64] A Integer vector of local var types for each elt 
[65] a of LOCAL (1l:label; 2:result; 3:argument; 4:local): 
[66] AALTYPE€tO 
[67] A Index into PARSED of object whose globals are 
[68] A currently being evaluated: 
[69] AALCPARSED¢€,1 
[70] A Index into partition of GLOBAL (for the object 
[71] A whose globals are currently being evaluated) of 
[72] a the object being evaluated: 
C73] AALCGLOBALE,1 
[74] a 
[75] a Determine index into NAMES of object being evaluated: 
[76] AALOOP: AAINDEX©AAGLOBALL AAGDSP[ AALCPARSED[1]]+ 
AALCGLOBAL([11]] 
[771 A What's its name? 
[78] AANAMECAANAMES[ AAINDEX; ] 
C79] AANAME€CAANAME?#’ ')/AANAME 
[80] A Loop on elts of LCPARSED from local to global 
[81] A (1I=2,3,...) Cto check whether object is locally 
[82] Aa defined at higher level): 
C83] AAT€¢1 
[84] AALIM¢71+e9AALCPARSED 
[85] AALP1: ®7CAALIM<AAT¢AATI+1) 9pAAL3 
C86] AAIND€AALCPARSED[ AAT] 
[87] A List of local objects at this level: 
[881 AATCAALOCAL[LAALDSP[ AAINDJ+LAALNUM[AAIND]] 
[891 A Search for this object: 
[90] AAJ*¢AATLAAINDEX 
[91] A Move up a level if not found: 
[92] 7*CAAJ>eAAT) PAALPI 
[93] A If found, determine type of local: 
[94] AATCAALTYPELAALDSPLAAIND]+AAd ] 
[95] A Display object name, fn where local, type: 
[96] AAR¢AANAMES[ AAPARSED[ AAIND]; ] 
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V 
[97] 
[98] 
[99] 
C100] 


C101] 
C102] 
C103] 
C104] 
[105] 
C106] 
C107] 
[108] 
C109] 
[110] 
C111] 
C112] 
C113] 
[114] 
C115] 
C116] 
C117] 
C118] 
C119] 
C120] 
C121] 
C122] 
C123] 
[124] 
C125] 
£126] 
(127) 
C128] 
C129] 
[130] 
C131] 
C132] 
[133] 
£134] 
C135] 
C136] 
C137] 
C138] 
C139] 
C140] 
[141] 
[142] 
C143] 
[144] 
[145] 
C146] 
[147] 
[148] 
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USEDBY (continued) 


AAR€ CAAR#! ')/AAR 
AAT€(4 8 oe'’label result argumentliocal ‘LAAT; ] 
AAT CAAT#! ')/AAT 
O¢( C(AAINDENTX~1+ oAALCPARSED)e’ '),AANAME,’ (',AAR,!’ - 
ChAT St a 
>AALI10 
A 
a Branch if object is a function: 
AAL3:3°(3=O0NC AANAME) pAAL4 
A Display object name and global indicator: 
D+ ( CAAINDENTx~1+eAALCPARSED)Je’ '),AANAME,’ (global)! 
>AAL10 
A 
A Display object name: 
AAL4:0¢( (AAINDENTX~1+pAALCPARSED)o'’ '),AANAME 


A Branch if fn has already been parsed: 
>( CoOAAPARSED) ZAAJ©AAPARSEDLAAINDEX) pAALY 
AAPARSED¢AAPARSED, AAINDEX 

A 

A Analyze fn for global, local identifiers... 

Aa Construct visual representation: 
AAVR¢OVR AANAME aA APL*PLUS 

AR AAVR€1 OFD AANAME A SHARP APL 

A AAVR¢CRAVR OCR AANAME aA Other APL systems 

A Update selected vars: 

AAGDSP¢AAGDSP, pAAGLOBAL 
AALDSP¢AALDSP, pAALOCAL 

A Branch unless fn locked: 
+( Xp AAVR) ePAALS 

A No identifiers found if locked: 
AAGNUM¢AAGNUM, 0 
AALNUMCAALNUM,0O 
XAALI 

A Use origin O for fn parsing: 

AALS: DIO€0 

A Where does header end? 
AAT¢AAVRLAATCNL 

A Grab header of fn (less newline): 
AAHDR¢AATPAAVR 

A Drop header from vis rep: 
AAVREAATLAAVR 

A Where does fn syntax end? 
AAT¢AAHDRU’ ; ’ 

A Localized vars: 

AALOC¢CAATIAAHDR 

A Drop local vars: 
AAHDR©AATeAAHDR 

A Drop leading junk: 
AAHDR¢CV\~AAHDRE’ V')/AAHDR 

A Is there an explicit result? 
AAT¢( pPAAHDR) >AAINDCAAHDRU' €! 

A Explicit result (if any): 
AARESULT¢ (AATXAAIND) PAAHDR 
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USEDBY (continued) 
A Remove result from header: 
AAHDR€'’ ',((X9AARESULT) +9 AARESULT) 4 AAHDR 
a Starting indices of fn name, args: 
AASTART€1+CAAHDR=' ')/LEAAHDR 
A Lengths of names: 
AALEN¢€ C1LAASTART, 1+O9AAHDR)-1+AASTART 
A No. of args: 
AAT¢€ 1+ pAALEN 
A Indices of args (if any): 
AANCpAAIND€( (AAT=2)00), CXAAT) CAAT 
a Don't consider fn name: 
AASTART¢AASTARTLEAAIND] 
AALENCAALENL[ AAIND] 
A Length of longest arg: 
AACOLS€1 /0,AALEN 
A Raveled, blank mat of args: 
AAARGS€(AANXAACOLS) 0!’ ' 
A Compute indices: (LLEN[1]),(.LEN[2]),... 
AAI©CAALEN/-"~110,+\AALEN 
AAI€AAT+L9 AAI 
a Insert names into raveled matrix: 
AAARGS [ AAI+AALEN/ AACOLSXLAAN ]©AAHDR[I AAT+AALEN/ AASTART 
] 
A Reshape to mat of args: 
AAARGS€ CAAN , AACOLS ) pPAAARGS 
A Flag newline chars: 
AANLCAAVR=AATCNL 
A Flag nonquotes: 
AANQ©AAVR#!'!''! 
Map of chars not in quote pairs (i.e. char 
constants) within each fn line (i.e. an open 
quote is closed at the end of the fn line). 
Leading quotes are flagged 0; closing quotes are 
flagged 1 Cincluding Ist of double-quote pairs): 
AANCCONE= \AANQZAANL\AAT#Z 110, AATE~AANL/ =\AANQ 
AANQ€0 
Aa Flag non-a chars Cincludes as in quotes): 
AANCCAANCCONAAAVR='A' 
Aa Flag newlines or as (except as in quotes): 
AASCAANLZAANC 
A Map of chars which do not follow a a Cignoring as 
A within quotes) within each fn line. as are 
A flagged 0: 
AANCMT€~#\AAS\ AATF 110, AATE~AAS/AANC 
AAS©AANCEO 
A Map of chars which are not included within as or '': 
AAPARSE©AANCMTAAANCCON 
AANCCON¢AANCMT€0 
Aa Flag digits and letters: 
AANUM¢AAPARSEAAAVRE ! 0123456789" 
AAALP¢AAPARSEAAAVRE ' ABCDEFGHIJ KLMNOPORSTUVWXY ZAabcdef 
ghijklmnopqrstuvwxyza'’ 
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V USEDBY (continued) 
[199] a Flag blanks: 
[200] AABL€AAPARSEAAAVR=!' !' 
[201] AAPARSE€Q 
[202] a Flag alphanumeric chars: 
C203] AAAN€¢AANUMVAAALP 
[204] A Pole vec of contiguous digits: 
[205] AAPNUMCAANUM# 110, AANUM 
[206] AANUM¢O 
[207] a Pole vec of contiguous digits/letters: 
[208] AAPAN¢CAAAN# 110, AAAN 
[209] AAAN€O 
[210] a Pole vec of identifiers: 
[211] AAPID¢AAPAN\AATV” 1®AATCAAPAN/ AAALP 
C212] AAALPCAAPAN€O 
[213] a Flag 'O’ before identifiers (Onames): 
[214] AATeAAT\'D! =CAATECIOAAPID) /AAVR 
[215] a Shift leading poles of Onames to include QO: 
[216] AAPID¢AATVAAPID> LAAT 
[217] AAT€0O 
[218] a Flag char following ] after line no.: 
[219] AASTART¢” 19AAPNUM\ ~1®AAPNUM/ “20AANL 
C220} AANL©AAPNUME€O 
[221] A Pole vec of contiguous blanks: 
[222] AAPBL€AABL? 110, AABL 
C223] AABL€O 
[224] aA Flag 1st nonblank char in each line: 
[225] AASTART¢ CAASTART>AAPBL) VAAPBL\” 10AAPBL/AASTART 
[226] AAPBL¢O 
[227] aA Pole vec of identifiers at start of line: 
[228] AAPSIDCAAPID\AATV 1LOAATCAAPID/AASTART 
[229] A Pole vec of labels: 
[230] AASTART¢O 
[231] AAPLABCAAPSID\AATVIOAATE! : '=AAPSID/AAVR 
[232] AAPSID¢€0 
[233] a Start and end (+1) indices of identifiers: 
C234] AAINDCAAPID/LPAAPID 
[235] A No. of identifiers: 
[236] AANID¢(pAAIND) +2 
[237] AAIND¢CAANID,2) eAAIND 
[238] A Start indices of identifiers: 
[239] AAIDSTART¢AAINDLE 50] 
[240] A Lengths of identifiers: 
C241] AAIDLENCAAINDLE 35 13-AAIDSTART 
[242] a Starting indices of local vars: 
[243] AASTART€1+CAALOC=' 3')/LPAALOC 
[244] A Lengths of local vars: 
[245] AALEN¢C1LAASTART, 1+o0AALOC)-1+AASTART 
[246] A Length of longest ident.: 
[247] AACOLS¢CT /AALEN)T Cf /AATDLEN) SF Ce AARESULT) [1l 09AAARGS 
[248] A Pad arg names to conform: 
[249] AAARGS«((1peAAARGS) , AACOLS ) TAAARGS 
[250] A 0 row mat if no result: 
[251] AARESULT¢( (XpAARESULT) , AACOLS ) pPAACOLSTAARESULT 
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USEDBY (continued) 
A Raveled blank mat of local vars: 
AAT€ (CAACOLSXAAN€PAASTART) 0’ ' 
A Compute indices: (LLEN[1]),(LLEN[2]),... 
AAIT©AALEN/-~110,+\AALEN 
AAI€AAI+LGAAT 
A Insert names into raveled matrix: 
AATT AAI+AALEN/ AACOLSXLAANI©CAALOC[ AAT+AALEN/AASTART J] 
A Reshape to mat of local vars: 
AALOC€(AAN, AACOLS) pAAT 
A Raveled blank mat of identifiers: 
AAT€ CAACOLSXAAN¢OCAAIDSTARTIJ 0’ ' 
A Compute indices: (LIDLENI1]),(LIDLEN[2]),... 
AATCAAIDLEN/-~110,+\AAIDLEN 
AATf©AAT+L OG AAI 
a Insert names into raveled matrix: 
AATLAAT+AAIDLEN/ AACOLSXLAAN JEAAVRI AAT+AAIDLEN/ 
AAIDSTART ] 
A Reshape to mat of identifiers: 
AAIDS€(AAN, AACOLS ) pAAT 
A Mat of label names: 
AALABf€ CAAS€(C CAAN, 2) 0 AAPID/AAPLAB)L 301) 4AAIDS 
AAPID©AAPLAB¢eO 
A Mat of referenced vars less labels: 
AARVAR€¢ (~AAS) AAAIDS 
A Combine different types of vars and a vector 
A of their types: 
AAIDS€CAALAB, LOJAARESULT, COJAAARGS , [LOJAALOC) ,[0] 
AARVAR 
AAIDTYPES¢( C1OPAALAB) , Clee AARESULT) , ClepAAARGS) , (leo 
AALOC),1eeAARVAR)/ 123 45 
A Select just the first distinct name (Cand type): 
AATECCAAIDS CMIOTA AAIDS)=LeAAIDTYPES ) /LPAAIDTYPES 
AAIDSCAAIDSLAAT; ] 
AAIDTYPES¢AAIDTYPESL AAT] 
A Branch if no columns in IDS Ci.e. 
+(1LeAAIDS)LAAL5SA 
A Retain O-names only for system variables (add other 
A system vars on your APL system): 
AAS€ 5 4 o'OIO OPP ORL OCT OLX ' 
AAT€'O' =AATDSE 3:0] 
AAI©CAAT/LOAAT 
AAR€ CIP PAAS) SAAS CMIOTAC COAAT) ,4) TAAIDSC AAT; I 
A Branch if all O-names are in this list: 
>(V/AAR)JAALSA 
AAT¢~AAT\AAR 
A Squeeze out O-names which are system fns (e.g. 
AAIDS€AATAAAIDS 
AAIDTYPES¢AAT/AAIDTYPES 
A Return to origin 1: 
AALSA: 010¢€1 
Aa Insert new names into NAMES and convert to indices: 
>Cx CAAS€1LLOAANAMES ) -AAT€11 CAATDS)IOAALSB , AALE , AALZ 


no identifiers): 


OEX): 


=o. = 


Chapter 13 Solutions 


[326] 
C327] 
C328] 
C329] 
[330] 
C331] 
C332] 
C333] 
C334] 
C335] 
C336] 
C337] 
[338] 
[339] 
[340] 

V 


USEDBY (continued) 
A NAMES has more columns than IDS: 
AAL6: AAIDS€C ClO PAAIDS) ,AASJTAAIDS 
2AAL8 
A IDS has more columns than NAMES: 
AAL7: AANAMES¢€C€ (loge AANAMES ) , AAT) TAANAMES 
Q 
AAL8: AAIND©CAANAMES CMIOTA AAIDS 
A Flag those not found: 
AATCAAIND>AAR¢lLOE PAANAMES 
A Add to NAMES and compute indices: 
AANAMESCAANAMES , LIJAATZAAIDS 
AASCAAT/LOAAT 
AAINDE AAS ]©€AAR+L AAS 
AAIDS¢AAIND 
A How many identifiers are locals? 
AAN€  1+AAIDTYPESUS5 
AALNUM¢AALNUM, AAN 
AALOCAL€AALOCAL, AANpAAIDS 
AALTYPECAALTYPE , AANOPAAIDTYPES 
A How many identifiers are globals: 
AAIDS©CAANLAATDS 
AAGNUM¢AAGNUM, eAATDS 
AAGLOBAL€¢AAGLOBAL, AAIDS 
A 
A Branch unless this object has globals: 
AALI:>CXAAGNUME AAJ 1) 1AAL10 
A Add a level to the "state indicator": 
AALCPARSED¢AAJ , AALCPARSED 
AALCGLOBAL¢1 , AALCGLOBAL 
>AALOOP 
A 
A Increment "state indicator" line: 
AAL10: AAT¢AALCGLOBAL[E 1]€1+AALCGLOBAL[ 1] 
A Resume if more globals at this level: 
> (CAATSAAGNUME AALCPARSED[1J]])  eAALOOP 


A Else drop a level from the "state indicator": 


AALCPARSED¢1/AALCPARSED 
AALCGLOBAL€1/) AALCGLOBAL 

A Continue if any levels left: 
+( Xe AALCPARSED) pAAL1O 
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FILE DESIGN AND UTILITIES 


1. Here is a possible file structure for the "functions file'": 


FILE NAME: FNS 


TIE NUMBER: up to you 


DESCRIPTION: Contains representations of functions 


COMP. 
NO. 


1+1 


VARIABLE 


a_i 


C1] 
C2] 
[3] 
C4] 
[5] 
C6] 
C7] 
C8] 
[9] 


VR 


DESCRIPTION 

Character matrix directory of the names Cone 
per row) of the functions whose representations 
are stored on file. The matrix has as many 
columns as the longest function name has 
elements. The shorter names are left justified 
(padded to the right with spaces). 


Character vector visual representation of the 
function whose name is DIR(I;] Cor canonical 
representation if your APL implementation of 
APL does not support visual representations). 


CWSTD: FNSFILE] 


V FNAME FNCREATE TIE;DIR;T 

a Inititlizes an empty functions file named FNAME, 

a tied to TIE. 

FNAME OFCREATE TIE A APL*PLUS 

A FNAME UCREATE TIE aA SHARP APL 

A Construct empty directory: 

DIR< 0 0 oe!’’ 

A Append as ist component: 

T¢DIR OFAPPEND TIE A APL*PLUS if OFAPPEND has result 

A DIR OFAPPEND TIE aA APL*PLUS if OFAPPEND has no result 
C10] A DIR OAPPEND TIE A SHARP APL 


V 
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CWSID: FNSFILE]J 
V AAR¢CAATIE PUTFN AANAME;010;AADIR; AAIND; AAT$ AAVR 
Appends or replaces to the functions file tied 
to AATIE the visual representation (Cor canonical 
representation) of the function named AANAME. 
Note: will not file functions whose names are 
local to PUTFN. Returns AANAME if successful, 
else empty vector. 
Origin 1: 
C8] OL0¢1 
[9] a Construct visual representation: 
[10] AAVRCUVR AANAME aA APL*PLUS 
[11] A AAVR¢e1 OFD AANAME A SHARP APL 
[12] A AAVR¢OCR AANAME aA On other systems 
[13] A Return empty vector if function locked: 
[14] AAR¢’! 
[15] %(X/pAAVR)LO 
[16] A Read directory of function names: 
[17] AADIR¢OFREAD AATIE,1 A APL*PLUS 
[18] A AADIR¢OREAD AATIE,1 A SHARP APL 
[19] A Delete any blanks in name: 
[20] AAR€AANAME€CAANAME#’ ')/AANAME 
[21] A Search directory for function name (branch if 
[22] A not found): 
[23] »CC1leAADIR) <p AANAME) 0AAL1 
[24] AAIND¢€CAADIRA. =C1leAADIR) TAANAME) 11 
[25] %C(AAIND>1eeAADIR) PAALI 
[26] A Replace existing visual representation with 
[27] A new one: 
[28] AAVR OFREPLACE AATIE,1+AAIND A APL*PLUS 
[29] A AAVR OREPLACE AATIE,1+AAIND A SHARP APL 
C30] 0 
[31] A Add name to directory. Branch unless AANAME 
[32] A too long: 
[33] AAL1:3( CeAANAME) <11 0AADIR) CAAL2 
[34] A Pad columns in directory to length of AANAME: 
[35] AADIR€((1ppAADIR) , pAANAME) TAADIR 
C36] AAL2: AADIR€AADIR, £1] 0C1L0AADIR) TAANAME 
C37] A Replace directory: 
[38] AADIR OFREPLACE AATIE,1 A APL*PLUS 
[39] A AADIR OREPLACE AATIE,1 A SHARP APL 
[40] A Append function representation: 
[41] AAT¢AAVR OFAPPEND AATIE aA APL*PLUS if OFAPPEND has 


> 
| 
DDDDODODDOD 


result 

[42] A AAVR OFAPPEND AATIE A APL*PLUS if OFAPPEND has no 
result 

[43] A AAVR OAPPEND AATIE aA SHARP APL 


V 
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[WSID: FNSFILE] 

V AARCAATIE GETFN AANAME;0I03;AADIR3;AAIND; AAVR 
Reads from the functions file tied to AATIE the 
visual representation (or canonical representation) 
of the function named AANAME and defines that 
function in the active workspace. Note: will not 
define functions whose names are local to GETFN. 
Returns AANAME if successful, empty vector if 
function not found, numeric code if function not 
definable. 
Origin 1: 
C10] U10¢1 
[11] A Read directory of function names: 
C12] AADIR¢OUFREAD AATIE,1 A APL*PLUS 
£13] A AADIR¢COREAD AATIE,1 A SHARP API 
[14] A Delete any blanks in name: 
C15] AANAME¢CAANAME#’ ’')/AANAME 
[16] aA Search directory for function name Cexit if not 
[17] a found): 
[18] AARe’'' 
[19] »CC1LeEAADIR) <p AANAME) 00 
[20] AAIND¢€CAADIRA.=C1ILoAADIR) TAANAME) 11 
[21] 7*CAAIND>1eePAADIR) CO 
[22] A Read visual (or canonical) representation from 
[23] a file: 
[24] AAVR¢COFREAD AATIE,1+AAIND A APL*PLUS 
[25] A AAVREOREAD AATIE,1+AAIND A SHARP APL 
[26] A Define function in workspace (result is function 
[271 A name or numeric code indicating WS FULL, SYMBOL 
[28] A TABLE FULL,...): 
[29] AAR*CODEF AAVR A APLAPLUS 
C30] A AAR€3 OFD AAVR A SHARP APL 
[31] A AARCOFX AAVR A On other systems 

V 


o 
Ol 
4 
D>DDDIDOVOD OV 


C[WSID: FNSFILE] 
V R¢TIE DROPFN NAME;010;DIR; IND; LAST 
Removes from the functions file tied to TIE 
the visual representation (or canonical 
representation) of the function named NAME. 
Returns NAME if successful, empty vector if 
function not found. 
Origin 1: 
C7] OL0¢1 
[8] aA Read directory of function names: 
[9] DIR¢UFREAD TIE,1 A APL*PLUS 
[10] A DIR¢OREAD TIE,1 A SHARP APL 
[11] A Delete any blanks in name: 
[12] NAME¢CNAME#’ ')/NAME 
[13] A Search directory for fn name (exit if not found): 
[14] Re’! 
[15] %(C1leDIR)<pNAME) 00 
L16] IND¢C(DIRA.=(C1iloDIR)TNAME) 1 


mn 
i> 
id 
DDDD DD 
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C17] 
18] 
[19] 
C20] 
C21] 
[22] 
C23] 
[24] 
[25] 
[26] 
E2c3 
[28] 
[29] 


C9] 

C10] 
[11] 
C12] 
C13] 
C14] 
[15] 
C16] 
C17] 
[18] 
[19] 
[20] 
C21] 
C22] 
[23] 
[24] 
[25] 
[26] 
[27] 
[28] 
[29] 


V DROPFN (continued) 


+(IND>1lpeDIR) 0 
A Replace this name with last one in directory: 
LAST¢(eDIR)[1] 
DIR(C IND; J€DIREI LAST; ] 
DIR¢ ~1 0 JDIR 
A Replace this vis. rep. with last one on file: 
COFREAD TIE,1+LAST)OFREPLACE TIE,1+IND A APL*PLUS 
OFDROP TIE, 1 
A C(COREAD TIE,1+LASTJIOREPLACE TIE,1+IND A SHARP APL 
A ODROP TIE, 1 
A Replace directory: 
DIR UFREPLACE TIE,1 A APL*PLUS 
A DIR UREPLACE TIE,1 A SHARP APL 
V 


CWSID: FILEDOC] 

V FILE FILEDOC PAGE;01I10;UPP; BOTTOM; CMPS;D;DATA; DONE; 
FIRST; FOOT; HEIGHT ;1; LAST; LEFT; LIM; LINES ;MARGIN;N;NEW; 
NONDISPLAY ;OLDCMP;PNO;QUOTESS;START;T;TCNL;TITLE;TOP; 
TXT; W;WIDTH 

Displays paged file documentation. All output is 
via O¢ so replace all O¢ by custom fn (e.g. PRINT) 
to redirect output. PAGE: rows, columns, margins 
(top, bottom, left, right) FILE: tie no. if 
APL*PLUS or SHARP APL; otherwise a file name. 

Use origin 1: 
O1TO0¢+1 
A Format no.s to 10 digits (CLEAR WS default): 
OPP¢10 
A Activate file if IBM’s workspace 2 VAPLFILE: 
A USE FILE 

TOP¢PAGETL3 ] 

BOTTOM¢PAGE[ 4] 

HEIGHT*«PAGE(1]-TOP+BOTTOM 

LEFT¢PAGE([5] 

WIDTH¢PAGE[21-PAGE[6] 
A Construct newline character: 

TCNL¢OTCNL A APL*PLUS 
A TCNL¢OTC(C2] A APL2 
A TCNLCUAV([157] A SHARP APL 
A Format today’s date: 

D¢+sOTSC2 3 1] 

DI(D=' ')/teDI]e'/' 
A Format the time: 

T¢CSOTSl4]),':', 2T'O'’ ,s40TS[5] 
A Format the file name: 

TITLE¢, OFNAMESCOFNUMStUFILE;] A APL*PLUS 
A TITLE, ONAMESCONUMSLFILE;] A SHARP APL 
A TITLE¢FILE a Otherwise 


D>DDDOD ND 
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V FILEDOC (continued) 

[30] a Delete leading/trailing blanks: 

[31] TITLE€(-+/A\OTITLE=' 'JIC+/A\TITLE=' ‘')ITITLE 

[32] A Number of file components and first one: 

[33] S¢OFSIZE FILE a APL*PLUS 

[34] A S¢OSIZE FILE A SHARP APL 

[35] A S¢1,1+rHO FILE A 2 VAPLFILE 

[36] START¢S(1] 

[37] LIM¢sS{2] 

[38] N€¢LIM-START 

[39] A If 2 VAPLFILE: 

[40] A CMPS©EXITtiS¢RHo FILE 

[C41] Aa LIM¢N¢eCMPS©€CMPS/teCMPS 

[42] aA Exit if none: 

[43] 2NLO 

[44] A Format page title: 

[45] TITLE¢TITLE,'’ (€’,C€6N),'’ COMPONENT’ ,CC1#4N)0'S’),') * ', 
D,’ gare 

[46] A If 2 VAPLFILE: 

C47] A TITLECTITLE,’ (€',CEN),’ COMPONENT’ ,CC14AN)0'S'),’ OF ' 
sCOeSd a’) *& De 9 sT 

[48] TITLE<¢CTOPeTCNL) ,WIDTHTCLEFT»’ '),’FILE: ' ,TITLE 

[49] A Insert page number: 

[50] PNOel1 

C51] T¢’PAGE 1’ 

[52] TITLEL(-pT)TLpTITLE]¢T 

[53] A Build first page: 

[54] TXT¢TITLE,TCNL 

C55] aA Keep track of lines used so far (below top 

[56] a margin) in TXT: 

C57] LINES¢2 

[58] QUOTE¢’!''!' 

C59] A Define characters which don’t display normally: 

[60] NONDISPLAY¢OTCNL,OTCLF,OTCBS ,OTCBEL, UTCDEL,UTCNUL, 
OTCESC,OTCFF A APL*PLUS 

[61] A NONDISPLAY¢COAVE1 2 157 159 160] A SHARP APL 

C62] A NONDISPLAY¢OTC aA APL2 

[63] MARGINCLEFTo’ ' 

[64] A Include heading: 

C65] T¢TCNL,MARGIN, ’ COMPONENT SHAPE oe VALUE’ ,TCNL 

L6G) sEXTSTXT: Te MARGIN @3333255> @ Se See : 

[67] LINES¢LINES+2 

[68] A Loop by component: 

[69] OLDCMP<O 

C70] FIRST¢START 

C71] IZ¢START+ 1 

[72] A If 2 VAPLFILE: 

[73] A FIRST¢CMPS[1] 

[74] A TI€o 

[75] LOOP: >(DONECLIMsIT¢1I+1)/L2 

[76] A LOOP: »(DONECLIM<I¢1I+1)/L2 A 2 VAPLFILE 

[77] aA Read data from file: 

[781 NEW¢OFREAD FILE,I A APL*PLUS 

[79] A NEW¢OREAD FILE,I A SHARP APL 
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V FILEDOC (continued) 
C80] A NEW¢GET CMPS[I] A 2 VAPLFILE 
[81] A 
[82] a Branch unless this is the first component: 
C83] »CxOLDCMP)/L1 
[84] a Last distinct object read from file: 
C85] DATA<NEW 
[86] A Earliest such component of that object: 
[87] OLDCMP¢TI 
[88] A OLDCMP¢CMPSCILLIM] A 2 VAPLFILE 
[89] >LOOP 
[90] A Get next object if this one is identical (to 
[91] A last one): 
[921] L1:>(CCeeDATA) #eepNEW) /L2 
[93] A If 2 VAPLFILE: 
[94] A L1:>29C(CMPS[I]#1+CMPSEI+711)/L2 
[951 A >CCoeDATA) #e9NEW) /L2 
[96] »C(CeDATA)V. #eNEW) /L2 
[97] 2%CA/,DATA=NEW) /LOOP 
[98] A Format DATA (Ccomponents OLDCMP to I-1)... 
[99] A Format shape: 
[100] L2:S€(3eDATA),’ po ' 
[101] A Omit shape and 0 if a scalar; pad to line up 
[102] A with o's: 
[103] S€(-11IeS)T(3<eS)/S 
[104] A Combine component no.(s) and shape; compute 
[105] A remaining width: 
C106] T¢OLDCMPZI+71 
[107] A T¢OLDCMP4CMPS[I+7~1] A 2 VAPLFILE 
[108] T¢CS6OLDCMP) ,T/'-'’ ,aI+ 71 
[109] A T€CSOLDCMP) ,T/'-' ,SsCMPS[I+7~1] A 2 VAPLFILE 
[110] TeC9TeTITC-S5feTITT 
C111] S¢MARGIN,T,'’ ¢€’,S 
C112] WewWIDTH-oS 
[113] A Branch if a numeric variable: 
[114] >(0=1TOeDATA) peL3 
[115] A Else data is char; consider only up to W chars: 
C116] DATA¢CWLX/oDATA) oDATA 
[1171] A Replace nondisplayable chars by @: 
[118] DATAL CDATAENONDISPLAY)/teDATAI<' BH’ 
[119] A Double up quote chars: 
[120] DATA¢QUOTE, (€1+DATA=QUOTE)/DATA) , QUOTE 
[121] A Branch if data will fit on a single line: 
[122] »(W2eDATA)/1L4 
[123] a Else truncate and show ’'...': 
[124] DATA¢C(W+ 3) e0DATA),’...!’ 
[125] ~»L4 
C126] A If data is numeric, consider only up to 
[127] A 1+W+2 elements: 
C128] L3:DATA<((C1+FW+2)Lx/eDATA) oDATA 
[129] DATA€¢sDATA 
[130] A Format value from empty array as 10: 
[131] DATA€DATA, (O=pDATA)/'10! 
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V FILEDOC Ccontinued) 

[132] A Branch if data will fit on a single line: 
[133] >(W2eDATA)/L4 
[134] A Else trunc. at last space within wid and show ’...’: 
[135] DATA€(C+/V\' '=6(W+7-3)eDATA) oDATA),’...' 
[136] A 
C137] A Append variable definition to page: 
[138] L4:TXT¢TXT,TCNL,S,DATA 
[139] LINES¢LINES+1 
[140] DATA¢NEW 
C141] OLDCMP¢I 
[142] A OLDCMP©CMPS[I] A 2 VAPLFILE 
[143] LAST¢I+71 
[144] A LAST¢CMPS[I+°~1] A 2 VAPLFILE 
[145] a Branch for more unless bottom of page (2 lines 
[146] a for footnote) or end of file: 
[147] ~»(C(DONE<LINES<HETIGHT+ 2)/LOOP 
C148] A Construct footnote: 
[149] L5:T¢FIRSTZLAST 
C150] FOOT¢(-WIDTH)T’COMPONENT’ ,(Toe’S'),' ',CéFIRST),T/' TO 

', LAST 
[151] A Display page: 
C152] O¢TXT, (CHEIGHT-LINES) oTCNL) , FOOT, BOTTOMoeTCNL 
[153] FIRST¢LAST¢I 
C154] A FIRST¢LAST¢CMPSCILLIM] A 2 VAPLFILE 
[155] A Exit if no more components: 
[156] ~-»>DONE/O 
[157] A Format new page: 
[158] TXT<TITLE,TCNL 
[159] PNO¢PNO+1 
[160] T¢'PAGE ',éPNO 
[161] TxXTCC-eTITLeTITLEI¢T 
[162] LINES€2 
[163] A Insert new heading: 
[164] T¢TCNL,MARGIN, '’COMPONENT SHAPE oe VALUE’ ,TCNL 
[165] TXT¢TXT,T,MARGIN,’--------- -----  ----- / 
[166] LINES¢LINES+2 
C167] A Branch for more components: 
[168] >LOOP 

V 
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V EMPLOYEES; F1;F2;F3;G;GOOD;IND;NUM;P;R 


2. Employee name 
3. Employee age 
Uses subfns: IOTA,CATRECWS , DELREC,SELECTWS. 
Ask for choice on same line: 
CHOOSE: 0¢P¢’ADD, DELETE, LIST OR END: ' 
R¢CoP)L0 
A Branch based on 1st char of response: 
»(C’'ADLE'=1TR)/ADD, DELETE, LIST,END 
O<e'’'*x* INVALID CHOICE. CHOOSE FROM: ADLE’ 
»* CHOOSE 
A 
A 
ADD: 0¢' EMPLOYEE NUMBER (COR O IF DONE)’ 
Fi¢,O0 
a Continue if exactly 1 number entered: 
+(1=eF1)/A1 
O<'’*k*k ENTER 1 NUMBER’ 
»ADD 
A Branch to choice question if O entered: 
A1:9(CO=F1)/CHOOSE 
a Continue unless employee number already exists: 
>("1=lo(FP,1)IOTA F1)peA2 
O«e'exe EMPLOYEE ',(COF1),’ 
»ADD 
A2:0¢P¢’EMPLOYEE NAME (MAX 25 CHARACTERS): ' 
A Ask for name at end of same line: 
F2¢(oP)10 
Aa Continue unless name too long: 
>(2520F2)/A3 
O¢e'*x* NAME TOO LONG’ 
»A2 
A3:0¢'EMPLOYEE AGE’ 
F3¢,0 
A Continue if exactly 1 number entered: 
>(1=0F3)/A4 
Oe+’*x* ENTER 1 NUMBER’ 
9A3 
A Continue if a valid age: 
A4:°9CCF3=[ F3)ACF3217)AF3<499)/A5 
O¢'**x AGE MUST BE INTEGER FROM 17 TO 99’ 
»A3 
A Pad name to length 25: 
A5: F2¢25TF2 
A Catenate new values and ask for more: 
FP¢FP CATRECWS 1 
2ADD 


A ILLUSTRATION OF FILE UTILITY FUNCTIONS. 

A Assumes employee information is on file. The file 
A is identified by the global variable FP (file 

A parameters). Fields of file are: 

A 1. Employee number 

A 

A 


DD 


ALREADY IN LIST’ 
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V EMPLOYEES (continued) 
[52] A 
[53] A 


[54] DELETE:0¢’ENTER EMPLOYEE NUMBERS TO DELETE’ 


[55] 
C56] 
[57] 
C58] 
[59] 
C60] 
C61] 
[62] 
[63] 
[64] 
C65] 
C66] 
[67] 
C68] 
C69] 
[70] 
C71] 
C72] 
Lou 
[74] 
C75] 
[76] 
C77] 
C78] 


C1] 
C2] 
[3] 
[4] 
C5] 
C6] 
C7] 
[8] 
[9] 
[10] 


C11] 
C12] 
C13] 
C14] 
[15] 
C16] 
[17] 


a Ravel to insure a vector, not scalar: 


NUM¢ , 0 


Aa Continue if all valid numbers: 


IND¢CFP,1)IOTA NUM 
+(CA/GOOD¢ 17 CeNUM) eIND)/D1 
O<'’*x*x NOT FOUND: ',6(0(~GOOD)/NUM 
»DELETE 


A Squeeze out deleted employees: 
D1: FP¢FP DELREC IND 


*CHOOSE 


LIST: 0¢'NUMBER AGE NAME’ 


je’? 


A Prepare to sort employees by number: 


'' SELECTWS FP, 1 2 3 
GeaAFl 


A Sort and display: 


O<+(€5 0 7 0 6F1I(G],(1.5]F30G]),CCCoF1),3)e’ '),F20G3] 
fe’! 
>CHOOSE 


C(WSID: MULTIFLO] 


V FP INITFILE FT;BLK;DEL;DISP;INCR;L;LAY;R;TIE;W;O01I0 
A Initializes file. Assumes the file already 
A exists, contains no components and is tied to FP[2]. 


OT0¢+1 


a Check validity of arguments: 


OERROR( (1#po9FP)V2#eoFT)/'RANK ERROR’ 

DERROR( (2#lopFT)V11#pFP)/'FP LENGTH ERROR’ 

INCR¢FP[3] 

OERROR( (INCR41loFT)VFPL8]#+/xFT[1:1)/'FT LENGTH ERROR’ 
OERROR(FPV.#!FP)/'FP DOMAIN ERROR’ 

OERROR( (A/((CFTL1;)>0)/FTL23;1)€t4)SCFT01;1]¥.<O)VFTL1:1V 
.#1FT021;1)/'FT DOMAIN ERROR’ 

TIE¢FP[2] 

OERROR(~TIE€OFNUMS)/'FILE NOT TIED’ 

DISP¢FP[4] 

NERROR(DISP<10)/'DISPLACEMENT ERROR’ 

BLK¢FP[5] 

OERROR( BLK<0)/'’BLOCK SIZE ERROR’ 

UERRORCFPL6 7 9 10]V.#0)/'USE O FOR FP[6 7 9 10]’ 
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V INITFILE (Ccontinued) 


[18] DEL¢FP[11] 

[19] »*CxDEL)JJL1 

[20] OERROR(~C!DELJIELINCR)/'NONEXISTENT DELETION FIELD 
NUMBER’ 

[21] OERROR(C1V.4FT0C;1DELI)/'USE FT=1 1 FOR DELETION FIELD’ 

[22] L1:L€euo 

[23] LAY¢FP[1] 

[24] »CxLAY)!JL2 

[25] OERROR(~LAYELINCR)/'NONEXISTENT LAYER FIELD NUMBER' 

[26] OERRORCO=WeFTLI1;LAYJ])/’'LAYER FIELD INACTIVE’ 

[27] OERROR(CLAY=!1DEL)/'LAYER AND DELETION FIELDS ARE THE 
SAME’ 

[28] Lecco, (W#1)eW)e0 

[29] »C24FTC2;LAYI)eL2 

C30] Lel0,CWALIOW) 0’ ’ 

[31] L2:R¢e’’ OFAPPEND TIE 

C32] R¢eC€O O e'')OFAPPEND TIE 

[33] R«CO 0 e’'’)JOFAPPEND TIE 

[34] R¢FT OFAPPEND TIE 

[35] R¢CrtO)JOFAPPEND TIE 

[36] R€Ct0O)JOFAPPEND TIE 

(371 R¢FP OFAPPEND TIE 

C38] R€C(tO)JOFAPPEND TIE 

C39] R€¢L OFAPPEND TIE 

[40] R¢eCtO)JOFAPPEND TIE 

[41] »CDISP<=10)e0 

C42] R¢€vt0 


[43] LOOP:»(DISP>R OFAPPEND TIE) eLOOP 


C1] 
C2] 
C3] 
C4] 
C5] 
C6] 
C7] 
C8] 
[9] 


CWSID: MULTIFLO] 


V NFP¢FP CATREC MAT;ARPS;BIT;BLK;C; CMP; DATA; DCMP; DEL; 


DISP;F;FLD;FREC;FT;FILL;GOOD;1;:INCR; INDS; LAYER; LC; LEAD 
sLV;sMsMINsN;sNFLD;NREC;RMAT;RPS;S:SDISP3;SETS;T;TIE;VEC; 
W;sWID;0I0 


A Catenates rows of MAT to file. 


DERROR( (2<poMAT)V1#opFP)/'RANK ERROR’ 
OERROR(11#oFP)/'LENGTH ERROR’ 
NFP¢FP 


A Convert scalar or vector to matrix: 


>(2=eoeMAT)eLl1 


MAT¢("2T 1 1 ,eMAT) oMAT 


A Exit if no records to catenate: 
L1:>(xX1loeMAT)L0 


[10] OfoO¢l 

C11] TIE¢FPC2] 

[12] INCR¢FP[3] 

[13] DISP¢FPL[4] 

C14] BLK¢FP(5] 

C15] DEL¢!FP£11] 
[16] FILLEFPL111<0 
[17] FT¢UFREAD TIE,4 
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VY CATREC (continued) 


a Widths (no. cols.) of fields: 
WID¢FT(1; ] 

a Indices of active fields: 
FLD¢«CxWID)/LINCR 

Aa Exclude deletion field: 
NFLD¢o FLD¢ (FLD4DEL)/FLD 
NERROR((1#x/oMATIACILoMAT) 4+/WIDECFLDJ])/'LENGTH ERROR' 
OERROR(C24FT02;FLD])V.40=1TOopMAT)/'DOMAIN ERROR’ 
A Extend singleton across all columns: 
>(1#x/oMAT) 0 L2 

MAT¢(1,+/WID{[ FLD] ) oMAT 

L2: ARPS¢RPS¢UFREAD TIE,8 

A Branch unless ARPS should be read: 
9C=/FPL7 10])0L3 

ARPS¢UFREAD TIE,10 

A Branch unless layered file: 
L3:39(xXFPL1J)JL7 

A Read layer values: 

LV¢OFREAD TIE,9 

a Compute the col inds of MAT with layer values: 
LC¢(0,+\WIDDCFLDJ)CFLDLUFPL11J+UWIDCFPL[1]] 
a Convert to scalar if vector fld: 
VEC¢1=eLC 

LC¢(VECLoeLC)JeLC 

A Use layer value of 1st row of MAT: 

L4: LAYER¢MAT[1;LC] 

a Branch if a vector layer field: 

>VECoL5 

Aa Flag rows of MAT in this layer: 
GOODeMAT( 5s LCIA.=LAYER 

A ...and sets with this layer value: 
SETS¢LVA.=LAYER 

>L6 

L5:GOOD¢MATT >; LCJ=LAYER 

SETS¢LV=LAYER 

a Put remaining rows in RMAT: 
L6: RMAT€ (~GOOD) ¢MAT 

MAT<GOOD/MAT 

Aa Consider only non-full sets in this layer 
Aa or empty sets in any layer: 

SETS¢ CARPS=0) VSETSAARPS#BLK 

>L8 
L7:SETS¢ARPS#BLK 

a Convert to indices: 
L8: SETS¢SETS/toeSETS 

NREC¢looMAT 


A No. records filed so far: 
FREC¢O 
a Branch if no slots available in existing sets: 


L9:>(BLK=MIN¢eL/IARPSCSETS])eL25 

A Branch if more than 1 set needed: 
T¢NREC-FREC 

>(T>BLK-MIN)oL10 
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V CATREC (continued) 
[71] A Choose fullest set which will hold all recs: 
[72] S€¢BLK-|ARPSCSETS ] 
£73] S€¢SETSCStL/(CS2T)/S]) 
C74] 2%L11 
[75] a Choose set with most empty slots: 
[76] L10:S¢SETS( CIARPSCSETS])UMIN] 
[77] aA No. records to be inserted within the set: 
(78] Li1l:M¢TLRPSCSJI-|IARPS[S] 
[79] A No. records to be catenated within the set: 
[80] N¢(T-M)LBLK-RPS[S]J 
[81] A Displacement (no. components) before this set: 
[82] SDISP¢DISP+INCRxXS+ "1 
[83] A Branch if no deletion field: 
C84] %CxDEL)IL12 
[85] A Read deletion field for set S: 
C86] BIT¢OFREAD DCMP¢+TIE,DEL+SDISP 
[87] a Branch if no records to be inserted: 
[C88] 2%CxM)1L12 
[89] A Indices of available insertion slots: 
[90] INDS¢Mo(~BIT)/tLeBIT 
[91] aA Columns of MAT filed so far: 
[92] L12:c¢0 
[93] a Next field index: 
C94] I¢l1 
[95] A Loop by active field: 
[96] L13:%CI>NFLD) eL19 
[971] A Field number: 
C98] F¢FLD(I] 
[99] A Field width (no. columns): 
C100] WeWID[IF] 
[101] A Read field F for set S: 
[102] DATACOUFREAD CMP¢TIE,F+SDISP 
C103] aA Branch if a matrix field: 
[104]  >(W>1)eL15 
C105] a Column of MAT if a vector field: 
C106] C€eCtl 
[107] a Branch if no records to insert: 
[108] >(xM)LL14 
[109] DATALCINDSJI¢€MAT[E FREC+iUM:C] 
[110] A Branch if no records to catenate: 
[111] >(xXN)1L18 
[112] L14: DATA¢DATA,MATE (FREC+M)+iN;C] 
C113] »L18 
[114] aA Branch if no records to insert: 
[115] L15:39CxM)1L16 
[116] DATALCINDS ; J¢MATC FREC+i1M3;C+ iW] 
C117] a Branch if no records to catenate: 
[118] %CxN)JL17 
[119] L16: DATA¢DATA, [1 ]MAT(E (FREC+M)+ iN;C+ iW) 
C120] L17:Ce¢c+w 
[121] L18:DATA OFREPLACE CMP 
C122] Tl¢el[+1 
C123] 713 
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V CATREC (continued) 
[124] a Branch if no deletion field: 
[125] L19: LEAD€lL 
[126] 2»CxDEL)JJL22 
[127] Aa Branch if no records to insert: 
[128] 2%CxM)JL20 
[129] a Turn active record bits on: 
[130] BITLINDSIJ<1 
C131] LEAD€A/BIT=A\BIT 
[132] a Branch if no records to catenate: 
L133] w2%CxN)JLlL21 
C134] L20:BIT¢+BIT,Nol 
C135] L21: BIT OFREPLACE DCMP 
[136] aA Increment FREC by no. records added to this set: 
C137] L22: FREC¢€FREC+M+N 
[138] RPSCSI¢€RPSCSI]+N 
[139] FPC9I€FPL9]+T<¢O=ARPS[S] 
[140] A Replace layer value if file layered and 
[141] aA set initially empty: 
[142] 2CTAXFPL1I)IL24 
[143] A Branch if a vector layer field: 
[144] 2%VECeL23 
[145] LV{(S;13<LAYER 
C146] »%L24 
[147] L23:LVCSJ¢LAYER 
C148] L24:ARPS(CSJ]¢€C° 1 1)01+LEAD]XM+N+iARPSCES] 
C149] FPC7IJ¢FPL7I+N 
C1501 a Exit if all of MAT filed: 
[151] »(NREC=FREC)oL34 
[152]  9L9 
C153] A Add new set; no. fields to be appended: 
C154] L25:NFLD¢FP(3] 
[155] A No. records to be appended in next set: 
[156] N¢BLKLNREC-FREC 
C157] A Columns of MAT filed so far: 


[158] C¢<O0 
[159] aA Next field number: 
[160] Fel 


[161] A Loop by field: 

[162] L26:9CF>NFLD)eL32 

[163] WeWLDLF] 

[164] aA Branch unless a latent field: 
[165] 2C(xW)eL27 

[166] DATA€10O 

[167] 9L31 

[168] A Branch if a matrix field: 

[169] L27:9(W>1)9L29 

[170] aA Branch unless it’s the deletion field: 
[171] »(DEL#F)eL28 

[172] DATANol 

[173] 9L30 

C174] A Column of MAT if a vector field: 
C175] L28s:CeC+l1 

[176] DATA¢MATLEFREC+LN;C] 
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C1] 
[2] 
[3] 
C4] 
C5] 
[6] 
eas 
[8] 
[9] 
L10] 


A Catenates elements/rows of F1,F2,F3,... 
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CATREC (continued) 


2L30 

L29: DATA¢MATE FREC+iUN3C+ iW] 

C¢eC+W 

A Branch unless set must be padded to BLK records: 


L30:9FILLIL31 

DATA ( BLK,1/oDATA) TDATA 
L31:T¢DATA OFAPPEND TIE 
FeF+i 

»L26 

A Increment FREC by no. 
L32: FREC¢FRECHN 
ARPS¢ARPS ,N 
RPS¢RPS , TEN! BLKXFILL 
FPL7J¢FPC7I1+T 
FPC6J¢FPL6]+1 
FPLOJ¢EFP[9I]+1 

A Catenate layer value if file layered: 
9(xXFPL1])1L33 

LV¢LV,C1LJLAYER 

a Continue unless all of MAT filed: 
L33:>(NREC#FREC) eL25 
L34:FPCL10J¢€FPL10]+NREC 

A Branch unless layered: 

»*(xXFPL11J)1L36 

A Branch if no data left to file: 
+(xXlopRMAT)LL35 

A Put remaining rows in MAT, continue: 
MAT¢RMAT 

9>L4 
L35:LV OFREPLACE TIE,9 
L36:RPS OFREPLACE TIE,8 

>*(xDEL) JL37 

ARPS OFREPLACE TIE,10 
L37:FP OFREPLACE TIE,7 

NFP¢FP 


records added to this set: 


CWSID: MULTIFLO] 
NFP¢FP CATRECWS NREC;ARPS;BIT; BLK: CMP; DATA; DCMP; DEL; 
DISP;F;FILED; FLD; FREC;FT;FILL;GOOD;1I;INCR; INDS; LAYER; 
LEAD; LV;M;MIN;N;NFLD;NR;RANK;RPS:S;:SDISP;SETS;SHAPE;T; 
TIE;VAR;VEC3;W;WID;01I0 
to file. 
OERROR(C(C1#oe9FP)V1ivV.#oNREC)/'RANK ERROR’ 
NREC¢loeNREC 
OERROR(C11#oFP)/'LENGTH ERROR’ 
NFP¢FP 
OL0¢1 
TIE¢FP([2] 
INCR¢FP[3] 
DISP¢FP[4] 
BLK¢FPL[5] 
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VY CATRECWS (continued) 
[11] DEL€!IFP(11] 
[12] FILL¢EFPL111]<0 
[13] FT¢QFREAD TIE,4 
[14] A Widths (no. cols.) of fields: 
C15] WID¢FT([1;] 
[16] A Indices of active fields: 
C17] FLD¢€CxXWID)/LtINCR 
[18] A Exclude deletion field: 
[19] NFLD¢o FLD¢ (FLD#DEL)/FLD 
[20] A Index of next active field: 
C21] I¢1 
[221 A Loop by active field to verify field vars 
[23] A (bypass this loop to make the fn faster and 
[24] aA to live dangerously): 
[25] L1:2CI>NFLD)pL2 
[26] A Field number: 
[27] FeFLD[IL] 
C28] aA Field width (no. columns): 
[29] WeWID[F] 
[30] A Look at field variable: 
[31] RANK¢eSHAPE¢pVAR¢2'F',tF 
[32] OMERRORCC24FTC23F1)40=1TOCVAR)/'DOMAIN ERROR’ 
[33] A Continue if singleton data: 
[34] l¢iI+1 
[35] 2(1A.=SHAPE)eL1 
[36] JUERROR(CRANK>2)/'RANK ERROR’ 
[37] DNERRORC ( CRANK=1)ACW=1TSHAPE) vV CW=1) ANREC=1TSHAPE) ¥ (CRANK 
=2)ACSHAPEA. =NREC,W)VSHAPEA.=1,W)/'LENGTH ERROR’ 
[38] Ll 
C39] A Exit if no records to catenate: 
[40] L2:°CxNREC)JO 
[41] ARPS©¢RPS¢OFREAD TIE,8 
[42] A Branch unless ARPS should be read: 
[43]  2C=/FPL7 101) eL3 
[44] ARPS¢€OFREAD TIE,10 
[45] a Branch unless layered file: 
[46] L3:39C(xXFPLLJI)JL7 
[47] A Read layer values: 
[48] LV¢OFREAD TIE,9 
[49] @ Flag records filed so far: 
[50] FILED¢NRECe0 
[51] A Look at layer field: 
[52] L4:VAR¢e?e’F! ,éFP[1] 
[53] WeWIDCFP[1]] 
[54] VAR¢CNREC, (W>1)eW) eVAR 
[55] a Branch if vector field: 
C56] VECeW=1 
[57] »VECeL5 
[58] a Flag records with layer of 1st unfiled rec: 
[59] LAYER¢VAREIFILEDtO; J 
[60] GOOD¢€VARA. =LAYER 
[61] A ...and sets with this record: 
[62] SETS¢LVA. =LAYER 
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VY CATRECWS (Ccontinued) 


C63] 
[64] 
[65] 
[66] 
[67] 
[68] 
[69] 
[70] 
C71] 
[72] 
[73] 
[74] 
C75] 
[76] 
C77] 
C78] 
C79] 
C80] 
C81] 
[82] 
C83] 
[84] 
C85] 
C86] 
C87] 
[88] 
[89] 
C90] 
C91] 
[92] 
[93] 
[94] 
[95] 
[96] 
[97] 
[98] 
[99] 
C100] 
C101] 
C102] 
[103] 
C104] 
C105] 
[i106] 
[107] 
C108] 
[109] 
C110] 
C111] 
[112] 
C113] 
[114] 


>L6 
L5: LAYER¢VAR[ FILEDtO J 
GOOD¢VAR=LAYER 
SETS¢LV=LAYER 
A Convert to indices: 
L6: GOOD¢GOOD/ te GOOD 
NR<eGOOD 
Aa Consider only non-full sets in this layer or 
A empty sets in any layer: 
SETS€(CARPS=0) VSETSAARPS#BLK 
>L8 
L7 : GOOD¢ UNR¢NREC 
SETS©¢ARPS# BLK 
A Convert to indices: 
L8:SETS¢SETS/teSETS 
A No. records filed so far: 
FREC¢0 
a Branch if no slots available in existing sets: 
L9:>(BLKs£MIN¢L/|IARPSCSETS])eL24 
A Branch if more than 1 set needed: 
T¢NR-FREC 
+(T>BLK-MIN) eL10 
A Choose fullest set which will hold all recs: 
S¢BLK-!|ARPS([SETS ] 
S¢+SETSCStL/(CS2T)/S] 
»L11 
A Choose set with most empty slots: 
L10:S¢SETS[E CIARPSCSETSIJ])tMIN] 
A No. records to be inserted within the set: 
L11:M¢TLRPSCS]-!ARPSCS] 
A No. records to be catenated within the set: 
N¢(T-M)LBLK-RPSC[S] 
A Displacement (no. components) before this set: 
SDISP©DISP+INCRXS+°1 
a Branch if no deletion field: 
»*(xDEL)JJL12 
A Read deletion field for set S: 
BIT¢OFREAD DCMP¢TIE, DEL+SDISP 
A Branch if no records to be inserted: 
9(xM)JJL12 
A Indices of available insertion slots: 
INDS€«Mo (~BIT)/teBIT 
A Next field index: 
L1i2:I€1 
A Loop by active field: 
L13:%C(I>NFLD) eL18 
A Field number: 
F¢FLDIT] 
A Field width (no. 
WeWIDEIF] 
A Look at field variable: 
VAR€O'F! , oF 


columns): 
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VY CATRECWS (continued) 
[115] A Reshape singletons, vectors and 1-row matrices 
C116] A to proper rank, shape: 
[1171] VAR¢CNREC, (W>1)0W) o0VAR 
[1181] A Read field F for set 8S: 
[119] DATA¢UFREAD CMP¢TIE,F+SDISP 
C120] A Branch if a matrix field: 
[121] > C(W>1)eL1i5 
[122] a Branch if no records to insert: 
C123] 9(xM)L1L14 
[124] DATALINDS J€VAR[LGOOD[ FREC+iLM] J 
C125] A Branch if no records to catenate: 
[126] 2>CxN)JL17 
[127] L114: DATA€¢DATA, VARC GOODE (FREC+M)+tNJ] 
[128] %L17 
[129] A Branch if no records to insert: 
C130] L15:390CxM)JL16 
£131] DATALINDS 3; 1}€ VARI GOOD[I FREC+tUM]I; ] 
[132] a Branch if no records to catenate: 
C133] 2%CxN)JLL17 
C134] L16: DATA¢DATA, LLJVARCTGOOD[ CFREC+M)+1iN] 3] 
[135] L17:DATA OFREPLACE CMP 
[136] T¢I+1 
£137] 9L13 
[138] A Branch if no deletion field: 
{139] L18: LEADe1 
[140] »*(xDEL)JL21 
[141] A Branch if no records to insert: 
[142] 2CxM)/JL19 
[143] a Turn active record bits on: 
[144] BITCINDSIJ€1 
C145] LEAD€A/BIT=A\BIT 
[146] A Branch if no records to catenate: 
[147] 2C€xN)JL20 
C148] L19:BIT¢BIT,Noel 
[149] L20: BIT OFREPLACE DCMP 
[150] A Increment FREC by no. records added to this set: 
C151] L21: FREC¢FREC+M+N 
[152] RPSCSI¢RPSCSI+N 
[153] FP(9IJ¢FP[I9]1+T€O=ARPSES] 
[154] a Replace layer value if file layered and 
[155] a set initially empty: 
[156] X9CTAXFPLITIILL23 
[157] a Branch if a vector layer field: 
[158]  %VECpL22 
[159] LVCS;]¢€LAYER 
[160] %L23 
[161] L22:LVCLS]1<LAYER 
[162] L23:ARPS[S]J€C 71 1)01+LEADI]XM+N+1ARPS[S] 
[163] FPLC7I¢FPL7I+N 
[164] A Exit if all of data in field vars. filed: 
[165] > (NR=FREC)oL33 
[166] ?>L9 
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CATRECWS (continued) 
A No. records to be appended in next set: 
L24:N€BLKLNR-FREC 
A Next field number: 
Eel 
A Loop by field: 
L25:3(F>INCR)oL31 
WewWID(F] 
a Branch unless a latent field: 
>(xW)oL26 
DATA©v0 
>L30 
A Branch unless it’s the deletion field: 
L26:>(DEL#ZF) 0 L27 
DATA¢No1 
>L29 
A Look at field variable: 
L27:VAR¢?'F' ,6F 
A Reshape singletons, vectors and l1-row matrices 
A to proper rank, shape: 
VAR¢ CNREC, (W>1)  0W) eVAR 
A Branch if a matrix field: 
>(W>1)oL28 
DATA<VAR[ GOOD[ FREC+1iN]J J 
%L29 
L28: DATA¢VAR[LGOOD[ FREC+1iNI] 3] 
A Branch unless set must be padded to BLK records: 
L29:9FILLIL30 
DATA¢CBLK,1/oDATA) TDATA 
L30:T¢DATA OFAPPEND TIE 
FeF+1 
9125 
A Increment FREC by no. 
L31: FREC¢FREC+N 
ARPS€ARPS ,N 
RPS€RPS , T¢Ni[ BLKXFILL 
FPCL7J¢FPL71+T 
FPL61¢FPC6]+1 
FPCL9OIJ¢FPL9I+1 
A Catenate layer value if file layered: 
9(xFPL1])1L32 
LV¢LV,{1]JLAYER 
A Continue unless all of data in field vars. 
L32:9(NR#FREC) 0 L24 
A Branch unless layered: 
L33:9CxFPL1iI)IIL34 
A Branch if no data left to file: 
FILED[CGOODI]¢€1 
9(CA/FILED)IL4 
LV OFREPLACE TIE,9 
L34:RPS OFREPLACE TIE,8 
»(xDELJILL35 
ARPS OFREPLACE TIE,10 
L35:FPCIOQOJ¢€FPL1OJ+NREC 


records added to this set: 


filed: 
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V 
[220] 
[221] 
[222] 
[223] 
[224] 
[225] 

Vv 


CATRECWS (continued) 

FP OFREPLACE TIE,7 

NFP¢FP 

A Erase global field variables: 
F¢sé(NFLD,1)oFLD 

Fe'F' ,C+/' '=F)OF 

FCUEX F 


CWSID: MULTIFLO] 


VY INDS¢FP IOTA VALS;ARPS;BIT;DATA; DEL;DISP; FLD; FOUND; FT; 


C1] 

[2] 

[3] 

[4] 

[5] 

C6] 

[7] 

C8] 

[9] 

[10] 
C11] 
[12] 
C13] 
C14] 
C15] 
[16] 
C17] 
[18] 
C19] 
[20] 
[21] 
[22] 
[23] 
C24] 
L250 
[26] 
[27] 
[28] 
[29] 
f30] 
[31] 
ie 
[33] 
[34] 
[35] 
[36] 
C37] 
[38] 
C39] 


INCR; IND; LV;NDEL;NSET$NUM;3RPS;S;SDISP3;SET3$SETS ;SHAPE;T 
sTIE;VIND;WID;O0I0 

Searches through field FP[12] for VALS and 
returns 2-row matrix of indices. First row 
is set number (Corigin 1); second row is index 
Corigin 1) within set. Result contains “1s 
where corresponding value is not found. 

Shape of result is 2,eVALS. Requires CMIOTA 
function if a character matrix field. 
OERROR(C1#oeeFP)/'RANK ERROR’ 

DERROR(C12#o0FP)/' LENGTH ERROR’ 

OLTO¢l1 

INCR¢FPCU3 ] 
DEL¢€|FP([11] 

FLD¢FP([12] 

DERROR( (FLDELINCRIJAFLD#DELI/' INVALID FIELD NUMBER’ 
TIE€FPL2 ] 
FT¢OFREAD TIE,4 

a Width (no. columns) of specified field: 
WID¢FT([1;FLD] 

OERRORCO=WID)/'INACTIVE FIELD’ 

NOERROR(C CWIDA1LIACXeepVALS ) AWID# 1TeVALS)/'LENGTH ERROR’ 
A Numeric field? 

NUM¢24FT02;FLD] 

HERRORCNUM40=1TOeVALS)/’'’DOMAIN ERROR’ 
a Branch unless VALS is a scalar for a mat fld: 
»( CWID=1) VXoeeVALS ) oe L1 
Aa Treat a scalar as a vector if a matrix field: 
VALS¢WIDeVALS 
A Determine shape of result; then 'ravel’ VALS: 
L1: SHAPE¢ (-WID#1)leVALS 
VALS<(CCX/SHAPE) , C(WID4#1) eWID) oeVALS 
a Initialize ‘'raveled’ result: 
INDS«(2,x/SHAPE)o~1 
A Exit if VALS is empty or if no records on file: 
>€COE€SHAPE) VO=FPL10J)0L14 

DISP¢FP[4] 

ARPS¢©RPS¢OFREAD TIE,8 
A Branch unless ARPS should be read: 

+(=/FPL7 10])oL2 

ARPS€OFREAD TIE,10 


D9 DDDDNDVD OVD 
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[40] 
C41] 
C42] 
[43] 
C44] 
[45] 
[46] 
[47] 
[48] 
[49] 
[50] 
C51) 
[52] 
[53] 
[54] 
[55] 
[56] 
C57] 
[58] 
[59] 
[60] 
C61] 
[62] 
C63] 
[64] 
[65] 
[66] 
[67] 
[68] 
[69] 
[70] 
C71] 
C72] 
C73] 
C74] 
[75] 
[76] 
C77) 
[78] 
[79] 
[80] 
[81] 
[82] 
C83] 
[84] 
[85] 
[86] 
C87] 
[88] 
C89] 
[90] 
[91] 


V IOTA (continued) 


a Consider only nonempty sets: 
L2:SETS¢O#ARPS 

A ...and sets specified in <layers>: 
>(xFPC11])1/L6 

»(xONC 'layers’)IL6 

A Read layer values: 

LV¢OFREAD TIE,9 

a Branch if matrix layers field: 
>C2=e0LV)oeL3 
SETS¢SETSALVEelayers 

>L6 


a Convert <layers> to matrix if not already: 


FILE DESIGN AND 


UTILITIES 


L3:QOERRORCFTC1;FPCL11J]# 1telayers)/’ LENGTH ERROR’ 


2(2=-pplayersj)eL4 

layers¢((x/ l1lolayers), l1toelayers)elayers 
A Branch if numeric matrix field: 
L4:2(O=1T0e LV) eL5 

SETS¢SETSAClayers CMIOTA LV)<lopeplayers 
+L6 

L5:SETS¢SETSAV/LVA. =8layers 

A Convert to indices; erase <layers>: 
L6:SETS¢SETS/tLeSETS 

NSET¢oSETS 

LV¢OEX ‘layers’ 


A Indices into INDS of values not yet found: 


VIND¢ilpeVALS 

A Last set index: 

S€0 

A Loop by nonempty set: 
L7:>(S2NSET) oL14 

A Current set index and number: 

S¢S+1 

SET¢SETS[S] 

A Displacement (no. components) before this 
SDISP¢DISP+INCRXSET+ 1 

A Read field FLD for set SET: 
DATA¢COFREAD TIE,FLD+SDISP 

A Branch if deletion field unneeded: 
> (NDEL¢ARPSCSET]=RPSCSETIJ) eL9 

A Branch if deletion field needed: 

> (NDEL¢CARPS[SET1>0)/1L8 

A Active records are leading records: 
DATA CARPS[CSET], CWID>1) oWID) eDATA 
>L9 


A Read and apply deletion field for set SET: 


L8: BIT¢OFREAD TIE,DEL+SDISP 
DATA©BITZ DATA 

A Branch if a matrix field: 
L9:>(WID>1)0L10 

A Search algorithm for vector field: 
IND¢DATALVALS 

9L12 
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V IOTA Ccontinued) 
[92] Aa Branch if a numeric matrix field: 
[93] L10:2NUMeL11 
[94] a Search algorithm for character matrix field: 
[95] IND¢DATA CMIOTA VALS 
C96] 2%L12 
[97] A Search algorithm for numeric matrix field: 
C98] L11: IND€1++/A\VALSV.#&DATA 
[99] a Determine those values found in this set: 
[100] L12:FOUND¢IND<1eeDATA 
[101] A Continue to next set if none found: 
C102] IND¢FOUND/IND 
[103] 2%C€xpIND)JL7 
[104] A Consider deletion field if applicable: 
[105] »NDELeL13 
[106] IND¢CBIT/iLepBITILCITND]I 
[1071 A Insert indices in result: 
[108] L13:INDSL ;FOUND/VINDI¢SET,CO.5JIND 
[109] A Compress indices of remaining values: 
C110) BIT<¢~FOUND 
C111] VIND¢<BIT/VIND 
[112] a Exit if all found: 
[113] %CxeVIND)IIL14 
[114] aA Else, compress remaining values: 
[115] VALS¢€BITZVALS 


C116] >L7 
C117] L14:INDS¢(C2,SHAPE)oINDS 
Vv 


CWSID: MULTIFLO] 
V INDS¢€IOTARHO FP;3;ARPS;BIT;CMPS;DEL;3;FT;1I;LV;NSET;RPS; 

SETS ;START;TIE;0I0 
[1] a Returns 2 row matrix of indices of all active records 
[2] aon file. First row is set number (origin 1); second 
[3] a row is index (origin 1) within set. 
C4] QOERROR(1#eeFP)/’'RANK ERROR’ 
C5] HERROR(C11#oeFP)/' LENGTH ERROR’ 
C6] OTO¢1 
[7] a Branch if some records of file: 
[8] %C(xFPL10])eLI1 
[9]  INDS€ 2 0 00 
[10] ~L8& 
[11] L1i:TIE¢FP{[2] 
[12] ARPS€RPS<HUFREAD TIE,8 
C13] A Branch unless ARPS should be read: 
C14) 2%C=/FPL7 10)J)eL2 
[15] ARPS¢UFREAD TIE,10 
[16] A Consider only nonempty sets: 
C17] L2:SETS¢0O#ARPS 
[18] A ...and sets specified in <layers>: 
[19] »€xFP[1])/L6 
C20] 2»CxONC ‘layers’ )/JL6 
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V IOTARHO (continued) 


[21] 
[22] 
C23] 
[24] 
[25] 
[26] 
[27] 
[28] 
[29] 
[30] 
C31] 
[32] 
[33 
[34] 
[35] 
[36] 
C37] 
[38] 
[39] 
C40] 
C41] 
[42] 
[43] 
[44] 
[45] 
[46] 
[47] 
C48] 
[49] 
[50] 
[51] 
[52] 
[53] 
[54] 
[55] 
[56] 
[57] 
[58] 
[59] 
C60] 
C61] 
C62] 
C63] 
[64] 
[65] 
[66] 
C67] 
[68] 
C69] 
C70] 


A Read layer values and fld types: 
LV¢OQFREAD TIE,9 
FT¢OFREAD TIE,4 

A Branch if matrix layers field: 
>2(2=eeLVIJeL3 
SETS¢SETSALVElayers 
>L6 

a Convert <layers> to matrix if not already: 

L3:QOERRORCFT£L1;FPL1J1]# 1tplayers)/'LENGTH ERROR’ 
»(2=epplayers)oeL4 
layers¢((x/"lloelayers), “1felayers)oelayers 

A Branch if numeric matrix field: 

L4:9(0=1T0eLV)eL5 
SETS¢SETSAClayers CMIOTA LV)<loeplayers 
»L6 

L5:SETS¢SETSAV/LVA. =Slayers 

A Convert to indices; erase <layers>: 

L6:SETS¢SETS/LpSETS 
LV¢OEX ‘layers’ 

A Construct 2 rows: 
INDS¢!|ARPS([SETS] 

A I¢MONIOTA INDS: 
I¢I+tpI¢INDS/- 110,+\-INDS 
INDS¢€CINDS/SETS),(0.51]1 

A Exit if no deleted records: 
> (CARPSCSETS]A.=RPSCSETS]) eL8 

A Deletion field number: 

DEL¢!1FPL11] 

A Set no.s for which deletion bits are to be read: 
SETS¢CARPS[SETS]<0)/SETS 
NSET+¢pSETS 

A Exit if none: 
>*(xNSETILL8 

A Component numbers of deletion bit fields: 
CMPS¢(DEL+FP[4])+FP(C3]xSETS+71 

A Starts of each selected set's indices in result: 
ARPS¢!1ARPS 
START¢(C0,+\ARPSIJCSETS] 

A Compress ARPS to selected sets: 

ARPS¢ARPS(SETS J 

A Next set index: 
T¢1 

A Loop by set; read deletion bits: 

L7:BIT¢OFREAD TIE,CMPS[T] 

A Insert correct indices in result: 
INDS[2;5;STARTCI1+LARPSCI]]¢BIT/tpBIT 

A Increment and repeat if more: 

[T¢i+l] 
+(I<NSET) oL7 

L8:I¢OEX '/layers’ 


set inds, rec inds: 
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[WSID: MULTIFLO] 

V AAINDS€AAEXPR SLASHIOTARHO AAFP; AAACTIVE; AAALL; AAARPS ; 

AABIT;AADATA; AADEL3 AADISP; AAF 3 AAFLD; AAFLDS 3; AAFNAM3 AAFT 
sAAINCR; AAIND; AALV3; AANDEL; AANFLD3 AANSET 5 AARPS $ AAS; 

AASDISP;3 AASEL; AASET 3; AASETS ; AATIE 3 AAWID; UIO 

Loops through active sets doing the following: 
reads the fields specified in 111FP (calling them 
F5, F9, etc. for fields 5, 9, etc.), executes the 
character vector EXPR, converts the resulting bit 
vector to indices and returns the indices of all 
records found for which EXPR returns a 1. Result 
is a 2 row matrix: first row is set number (origin 
1); second row is index (origin 1) within set. If 
EXPR and 11/FP are empty, all records are selected. 

[10] Note that EXPR is executed in origin 1; e.g. 

[11] 'F3032]’ always refers to 2nd column. 

[12] MERRORCCIZ9epAAFP)V1<eeAAEXPR)/'RANK ERROR’ 

[131 QOERRORCO=1TOPAAEXPR)/’DOMATN ERROR’ 

[14] OQO10¢1 

C15} AAALLCO=AANFLD¢oAAFLDS¢11LIAAFP 

[16] OMERROR( CAAALL=AAEXPRV.#!' 'J)V11>o0AAFP)/' LENGTH ERROR’ 

[17] AAINCREAAFPL3] 

[18] AADELCIAAFP[11] 

C19] OERRORC CA/AAFLDSELAAINCR) SAADELEAAFLDS)/'INVALID FIELD 

NUMBER’ 

[20] AATIE¢CAAFP[2] 

[21] AAFT*¢OFREAD AATIE,4 

[22] A Width (no. columns) of specified fields: 

[23] AAWID¢AAFT[ 13; AAFLDS] 

[24] OERRORCOE€AAWID)/'INACTIVE FIELD’ 

C25] a Initialize result as empty: 

[26] AAINDS€ 2 0 00 

[271 A Exit if no records on file: 

[28] »*CxXAAFP[10]) 10 

[29] Aa Field names (e.g. ‘'F5 F9') to be erased below: 

[30] AAFNAM¢S CAANFLD, 1) 0AAFLDS 

[31] AAFNAM¢’F’ ,(C+/' '=AAFNAM) OAAFNAM 

[32] AADISP¢AAFP[4] 

C33] AAARPS©AARPS¢COFREAD AATIE,8 

[34] A Branch unless ARPS should be read: 

C35] 2C=/AAFPL7 1OJ)JeAALI 

[36] AAARPS¢OFREAD AATIE,10 

(371 A Consider only nonempty sets: 

[38] AAL1: AASETS©€OZAAARPS 

[39] a ...and sets specified in <layers>: 

[40] 2>CXAAFPI1I)JLAAL5 

C41] 2»C(xONC ‘'layers’ )LAALS 

[42] A Read layer values: 

[43] AALV¢OFREAD AATIE,9 

[44] A Branch if matrix layers field: 

C45] »C2=epeAALV) PAAL2 

[46] AASETS©AASETSAAALVE layers 

C47] ~»AALS 


C1] 
C2] 
C3] 
C4] 
C5] 
L6] 
C7] 
C8] 
[9] 


DDDDIDIDIDDIDID DDO 
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V SLASHIOTARHO (continued) 
Aa Convert <layers> to matrix if not already: 
AAL2: UERROR(AAFTL1;AAFP[11]11#~l1tolayers)/’'LENGTH ERROR’ 


[48] 
[49] 


[50] 
[51] 
[52] 
C53] 
[54] 
[55] 
[56] 
C57] 
C58] 
[59] 
C60] 
[61] 
[62] 
[63] 
[64] 
C65] 
[66] 
C67] 
[68] 
[69] 
C70] 
C71] 
C72] 
C73] 
C74] 
[75] 
C76] 
C77] 
[78] 
[79 ] 
[80] 
C81] 
C82] 
C83] 
[84] 
C85] 
C86] 
[87] 
C88] 
[89] 
C90] 
C91] 
[92] 
C93] 


[94] 
[95] 
C96] 
[97] 
C98] 
[99] 


»*(2=opeplayers) pAAL3 
layers¢((x/"ilolayers), “1Telayers)oelayers 
a Branch if numeric matrix field: 
AAL3:>(O=1TOPAALV) PAAL4 


AASETS©AASETSAClayers CMIOTA AALV)<leplayers 


2AAL5 
AAL4 : AASETS€CAASETSAV/AALVA. =Qlayers 
A Convert to indices; erase <layers>: 
AALS: AASETS¢AASETS/ Up AASETS 
AANSET€eAASETS 
AALVCOEX ‘layers’ 
Aa Last set index: 
AAS€0 
A Loop by nonempty set: 
AAL6: 9 (AASZAANSET) 00 
a Current set index and number: 
AAS€AAS+1 
AASET¢AASETSLAAS ] 


A Displacement (no. components) before this set: 


AASDISP©CAADISP+AAINCRXAASET+ 71 

A Branch if deletion field unneeded: 
> CAANDEL©AAARPS[ AASET ]>0) pAAL7 

A Read deletion field for set SET: 
AABIT€OUFREAD AATIE, AADELt+AASDISP 

A Branch if no selection expression: 
AAL7: AASEL€1 

»*AAALLGAAL12 

A Are all records in this set active? 
AAACTIVE¢AAARPS[ AASET]=AARPSCAASETY] 
A Last field index: 

AAF€¢0O 

A Loop by field specified: 

AAL8: > CAAFZAANFLD) pAALI11 

Aa Current field index and number: 
AAFfAAF+1 

AAFLD¢AAFLDSL[AAF] 

A Read field FLD for set SET: 
AADATA¢COFREAD AATIE,AAFLD+AASDISP 

@ Branch if all records in this set active: 
+AAACTIVE@AAL10 

A Branch if deletion field needed: 

2 AANDELJAALY 
A Active records are leading records: 


AADATA©€ (AAARPS[AASET] , CAAWIDIAAFI>1) PAAWIDLIAAFI)0 


AADATA 

2AAL10 

A Apply deletion field: 

AALS: AADATACAABITAAADATA 

A Assign data to global variable Fn: 
AAL10: ¢'F’, (CEAAFLD) , '’©€AADATA’ 

9AAL8 
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VY SLASHIOTARHO (continued) 

A Once all fields have been read, 
AALI11: AASEL©fAAEXPR 

A Erase field variables (e.g. F5, F9,...): 
AAIND¢€OEX AAFNAM 

A Continue to next set if none found: 
AAL12: AAIND€AASEL/ 1! AAARPS[ AASET J] 
>(XepAAIND) LAALB 

a Consider deletion field if applicable: 
>AANDELeAAL13 

AAIND€ CAABIT/LECAABITILAAIND] 

A Catenate indices to result: 

AAL13: AAINDS€AAINDS , AASET,[O0.5]JAAIND 
2AAL6 


execute EXPR: 


[WSID: MULTIFLO] 
V NFP¢FP DELREC INDS;BIT;CMP;DEL;DISP3;F;FLD;FLDS;INCR; 
IND:;NEW:NFLD;NSET:OLD;RPS;3S;:SDISP:SET;SETS;TIE;UNQO;01I0 
A Deletes records identified by file indices matrix 
A INDS. First row is set number (origin 1); second 
A row 1S index (origin 1) within set. INDS may be 
a of any dimension as long as its first coordinate 
Aa is 2. 
OERROR(1#e09FP)/'RANK ERROR’ 
OFRROR((11#7eFP)V241ToINDS)/’'’ LENGTH ERROR’ 
A Exit if nothing to delete: 


C1] 
C2] 
[3] 
[4] 
[5] 
C6] 
[7] 
C8] 


[9] NFP€FP 

[10] 2» COE€eINDS)e0 

[11] SETS¢, 1 0 4INDS 

C12} INDS<, O 1 AINDS 

[13] O10¢1 

[14] TIE¢FPL[2] 

[15] INCR¢FP[3] 

[16] DISP¢FP[4] 

(17] DEL€!FPC11] 

[18] A RPS will be changed if DEL=0; ARPS will be changed 
[19] a if DEL>O. Read and replace only one or the other: 
[20] RPS¢UOFREAD TIE,8+2xxDEL 

[21] A Determine active fields if no deletion field: 
[22] %C(xDEL)JeL1 

[23] NFLD¢eFLDS¢C(xCOFREAD TIE,4)£13;1)/tINCR 

[24] A Determine distinct set numbers (deleting 1s): 
[25] L1:UNQ¢SETSL4SETS] 

[26] NSET¢eUNQ€ CUNO? 114°1,UNQ)/UNQ 

[271 aA Last set index: 

[28] S<€0O 

[29] a Loop by distinct set: 

[30] L2:%(S2=NSET)oL5 

[31] aA Current set index and number: 

[32] S¢S+l 

[33] SET<¢UNQLS] 
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V DELREC (continued) 
[34] A Displacement (no. components) before this set: 
[35] SDISP¢€DISP+INCRXSET+ 1 
[36] A Indices to delete in this set: 
C37] IND¢CSET=SETS)/INDS 
[38] A Branch unless deletion field exists: 
C39} ?%CxXDEL)JL3 
[40] A Read deletion field for set SET: 
[41] BIT¢OUFREAD CMP¢TIE, DEL+SDISP 
[42] a Turn off specified indices and replace: 
[43] BITCINDJ€0O 
[44] BIT OFREPLACE CMP 
C45] A Compute new no. records: 
[46] NEW¢+/BIiT 
[47] OLD¢€iRPSCSET] 
[48] A Reset parameters: 
C49] NFP£9IJ¢NFPL9I]-NEW=0 
[50] NFPL10J¢NFP[L101+NEW-OLD 
[51] RPSCSETI€¢NEWX(C 71 1)01+NEW=+/A\BITI 
[52] ?L2 
[53] A 
[54] a Turn off specified indices: 
[55] L3:OLD¢RPS([ SET] 
[56] BIT¢OLDe1 
[57] BITLCINDIJ€O 
C58] A Compute new no. records: 
[59] NEW¢+/BIT 
[60] A Reset parameters: 
[61] NFPLOIJ¢NFP[9]-NEW=0 
C62] NFPC7I¢NFPL10]¢NFP[7J]+NEW-OLD 
[63] RPSCSETIJ¢NEW 
[64] A Last field index: 
[65] F¢o0 
[66] A Loop by active field: 
C67] L4:%CF2NFLD)oeL2 
[68] A Current field index and number: 
C69] FePr+] 
[70] FLD¢FLDS(CF] 
C711 A Read, compress, replace field FLD for set SET: 
C72] CMP¢TIE,FLD+SDISP 
[73] (CBIT/OFREAD CMP)OFREPLACE CMP 
[74] ~»L4 
[75] A File either RPS or ARPS: 
[76] L5:RPS OFREPLACE TIE,8+2xxDEL 
[77] NFP OFREPLACE TIE,7 

V 
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C1] 
[2] 
[3] 
C4] 
C5] 
C6] 
[7] 
[8] 
[9] 
[10] 
C11] 
C12] 
C13] 
[14] 
[15] 
[16] 
C17] 
C18] 
C19] 


C20] 
C21] 
[22] 
C23] 
[24] 
[25] 
[26] 
C27] 


[28] A Field names (e.g. 


[29] 
C30] 
C31] 
[32] 
[33] 
[34] 
[35] 
[36] 
C37] 
C38] 
[39] 
[40] 
[41] 
[42] 
[43] 
[44] 
[45] 
[46] 
[47] 
[48] 
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FILE DESIGN AND UTILITIES 


[WSID: MULTIFLO] 

AANFP¢€AAEXPR COMPRESS AAFP; AAAFLDS ; AAALL; AAARPS$ AABIT; 
AACMP ; AADATA3 AADEL; AADISP3$ AAF3; AAFLD; AAFLDS 3; AAFNAM 3; AAFT 
sAAINCR; AALV3; AANAFLD;3 AANDEL; AANEW; AANFLD;$ AANSET; AAOLD; 
AARPS $ AAS ; AASDISP3 AASEL; AASET; AASETS § AATIE3 AAWID; 0I0 
Loops through active sets doing the following: 

reads the fields specified in 11/FP (calling 

them F5, F9, etc. for fields 5, 9, etc.), 

executes the character vector EXPR, and deletes 

the records of that set which correspond to Os 

in the resulting bit vector. If EXPR and 11/FP 

are empty, all records are selected (none are 
deleted). Note that EXPR is executed in origin 

1; e.g. 'F3[32]'’ always refers to 2nd column. 

OERROR( (1# 9p AAFP) V1<p9AAEXPR)/'RANK ERROR’ 
ONERRORCO=1TOPAAEXPR)/'DOMAIN ERROR’ 

OT0¢1 

AAALL¢O=AANFLD¢ op AAFLDS©LILAAFP 

OERROR( CAAALL=AAEXPRV.#!' 'JVI1>o9AAFP)/' LENGTH ERROR’ 


A Exit if all records selected: 


A Width Cno. 


XAAALLOO 

AAINCREAAFPLE3 ] 

AADELE€!|AAFP[11] 

QNERROR( CA/AAFLDSELAAINCR) SAADELEAAFLDS)/'INVALID FIELD 
NUMBER’ 

AATIE€CAAFPE2 ] 

AAFT€0O FREAD AATIE,4 

columns) of fields: 

AAWIDCAAFT(E13 ] 

ONERROR(COQO€AAWIDIAAFLDS1)/'INACTIVE FIELD’ 


A Exit if no records on file: 


A 


A 


AANFP¢€11pAAFP 
>(XAAFPL10])10 
'F5 F9') to be erased below: 
AAFNAM¢6 CAANFLD, 1) 0 AAFLDS 
AAFNAME'F' ,C+/' '=AAFNAM) ®AAFNAM 
AADISP€AAFPLE4] 
AAARPS©€AARPS¢OFREAD AATIE,8 
Branch unless ARPS should be read: 
>C=/AAFPE7 10])eAAL1 
AAARPS¢COFREAD AATIE,10 
Determine active fields if no deletion field: 


AAL1:>(XAADEL) pAAL2 


A 


A 


A 


A 


A 


AANAFLD©¢p AAAFLDS€CXAAWID) /LAAINCR 
Consider only nonempty sets: 
AL2: AASETS€O0#AAARPS 

-.-and sets specified in <layers>: 
>CXAAFPL1J) LAAL6 
»(xONC ‘layers’ JIAAL6 

Read layer values: 
AALV<OFREAD AATIE,9 

Branch if matrix layers field: 
*(C2=eE9AALV) PAAL3 
AASETS¢AASETSAAALVE layers 
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V COMPRESS (continued) 


[49]  2AALE 

[50] a Convert <layers> to matrix if not already: 

[51] AAL3:QOERRORCAAFT[1;AAFP[L1]]# 1Tfelayers)/'LENGTH ERROR’ 

[52] »€2=pplayers) eAAL4 

[53] layers¢((x/"llelayers), 1fpelayers)olayers 

[541 aA Branch if numeric matrix field: 

[55] AAL4:3(0=1TOCAALV) PAALS 

[56] AASETS€AASETSAClayers CMIOTA AALV)<leelayers 

[57] ?>AAL6 

[58] AALS: AASETS¢AASETSAV/AALVA. =Qlayers 

[59] a Convert to indices; erase <layers>: 

[60] AAL6: AASETS€CAASETS/ tS AASETS 

[61] AANSET¢eAASETS 

[62] AALV¢OEX ‘layers’ 

[63] A Last set index: 

C64] AAS¢€O 

[65] A Loop by nonempty set: 

[66] AAL7:>CAASZAANSET) pAAL17 

[67] A Current set index and number: 

[68] AAS€AAS+1 

[69] AASET€¢AASETSL AAS] 

[70] A Displacement (no. components) before this set: 

[71] AASDISP¢€AADISP+AAINCRXAASET+° 1 

[72] A Branch if deletion field unneeded: 

[73] 7» CAANDEL©CAAARPSE AASETI>0) PAAL8 

[741 A Read deletion field for set SET: 

[75] AABIT¢OFREAD AATIE, AADEL+AASDISP 

[76] A Are all records in this set active: 

C77] AAL8: AAALL€AAARPS[ AASET]=AARPS[E AASET] 

[78] A Last field index: 

[79] AAF€0O 

[80] A Loop by field specified: 

[81] AAL9:>CAAFZAANFLD) PAAL12 

[82] A Current field index and number: 

C83] AAFfAAF+H+1 

[84] AAFLD€AAFLDS[AAF] 

[85] A Read field FLD for set SET: 

[86] AADATA€¢O FREAD AATIE,AAFLD+tAASDISP 

[87] A Branch if all records in this set active: 

[88] ~»AAALLEAALI11 

[89] A Branch if deletion field needed: 

[90] XAANDELIJAAL10O 

[91] A Active records are leading records: 

[92] AADATA©€CAAARPS[ AASET], CAAWIDELAAFLDI>1) pPAAWIDIAAFLD] Jo 
AADATA 

C93] »AALI11 

[94] A Apply deletion field: 

[95] AAL10: AADATA¢CAABITZAADATA 

[96] A Assign data to global variable Fn: 

[97] AAL11:¢'F’ ,CEAAFLD),'¢AADATA’ 

[98]  >AALY 

[99] A Once all fields have been read, execute EXPR: 


C100] AAL12: AASEL¢2AAEXPR 
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COMPRESS (continued) 
A Erase field variables (e.g. F5, F9,...): 
AAOLDCUEX AAFNAM 
Aa Continue to next set if all records selected: 
+(A/AASEL) pAAL7 
a Branch unless deletion field exists: 
»>(xXAADEL)JAAL15 
Aa Branch if deletion field not read in: 
>AANDELeAAL13 
A Reset BIT: 
AABIT¢AABIT\AASEL 
2AAL14 
AAL13: AABITCAARPS[ AASETITAASEL 
A Replace deletion field: 
AAL14: AABIT OFREPLACE AATIE,AADEL+AASDISP 
A Compute new no. records: 
AANEW¢+/AABIT 
AAOLD¢€ | AAARPS[ AASET ] 
A Reset parameters: 
AANFPL[9 J€AANFPL9 ]-AANEW=0 
AANFP([10]¢AANFP[10]+AANEW-AAOLD 
AAARPS[ AASETI©AANEWXC 71 1ICTL+AANEW=+/A\ AABIT]I 
9AAL7 
A Compute new no. records: 
AAL15: AANEW¢+/AASEL 
AAOLD¢AARPS[ AASET ] 
A Reset parameters: 
AANFPL9IJ©AANFP[I9J]-AANEW=0 
AANFPL[7 J€AANFP[E 10 ]<AANFPE 7 ]+AANEW-AAOLD 
AARPS[ AASET]I¢AANEW 
a Last field index: 
AAF€O 
A Loop by active field: 
AAL16:>( AAFZ2AANAFLD) 9pAAL7 
A Current field index and number: 
AAFCAAFtH1L 
AAFLDCAAAFLDS[ AAF J 
A Read, compress, replace field FLD for set SET: 
AACMPCAATIE, AAFLD+AASDISP 
CAASELAOFREAD AACMP)JUFREPLACE AACMP 
2AAL16 
AAL17: AANFP OFREPLACE AATIE,7 
A RPS has been changed if DEL=0; ARPS has been changed 
A if DEL>O. Replace only one or the other. 
»(XAADEL) pAAL18 
AARPS OFREPLACE AATIE,8 
>0 
AAL18:AAARPS ODFREPLACE AATIE,10 
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FILE DESIGN AND UTILITIES 


C[WSID: MULTIFLO] 


V MAT¢INDS INDEX FP;CHR;COL;COLS;DATA;DEL;DISP;F;FLD; 


FLDS;FT;INCR;IND;NFLD;NREC;NSET3S;SDISP3;SET;SETS ;SHAPE 
sTIE;UNQ;W;WID;UIO0 

Returns data from fields 11/FP for records 
identified by file indices matrix INDS. 
First row of INDS is set number (origin 1); 
second row is index (origin 1) within set. 
INDS may be of any dimension as long as its 
first coordinate is 2. Result has same number 
of columns as fields 11/FP have columns. 
Leading shape of result is 1J/oINDS. 
DERROR(1#peFP)/'RANK ERROR’ 
OERRORCC€11>oeFP)V241TeINDS)/' LENGTH ERROR’ 
OHITO0¢+1 
NFLD¢o FLDS¢11/FP 

INCR¢FPLE3 ] 

DEL¢€!1FPL11] 

DERROR( CA/FLDSELINCR) =DELEFLDS)/'INVALID FIELD NUMBER’ 
TIECFP[2] 

FT¢COFREAD TIE,4)03FLDS] 
A Width (no. columns) of specified fields: 
WID¢FT[13 ] 

DERRORCOE€WID)I/' INACTIVE FIELD’ 
a Fields must be all character or all numeric: 
CHR¢e2=1TFTL2;] 

HERROR(CCHRYV.#2=FT£23;])/'DOMAIN ERROR’ 

A Shape of result (excluding columns): 
SHAPE¢11/oINDS 

COLS¢€+/WID 

A Break apart indices: 

NREC¢opSETS¢€, 1 0 ZINDS 

INDS¢, O 1 #4INDS 

A Construct all-zero or all-blank result: 
»CHRoeL1 

MAT+(NREC,COLS) 00 

»>L2 
L1:MAT¢CNREC,COLS)o0’ ' 

RA Exit if no indices or no fields: 
L2:>(XNRECXCOLS )1L6 

DISP¢FP[4] 

A Determine distinct set numbers (deleting “1s): 
UNQ¢SETSL[ASETS] 

NSET¢oUNQ¢ (CUNQ# “11 °1,UNQ) /UNQ 
Aa Last set index: 

S¢0 
A Loop by distinct set: 
L3:>(S2NSET) 0 L6 
a Current set index and number: 

S¢S+1 

SET¢UNQOC[S] 
A Displacement (no. components) before this set: 
SDISP©+DISP+INCRXSET+ 1 


DDdDDIDIDIO OO 
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V INDEX (continued) 
[50] A Indices of INDS to retrieve in this set: 
LSJ IND¢(SET=SETS)/UNREC 
[52] a Last field index, and columns inserted so far: 
[53] F¢COLe0 
[54] A Loop by specified field: 
[55] L4:9(F2NFLD)0L3 
[56] aA Current field index, number and width: 
C57] FeF+l 
[58] FLD¢€FLDSCEF] 
[59] WeWID[F] 
[60] A Read data: 
[61] DATA¢COFREAD TIE,FLD+SDISP 
[62] A Branch if matrix field: 
[63] 2(W>1)eL5 
[64] A Insert vector of data: 
[65] COL€COL+1 
[66] MATLIND;COLIJ€DATALINDSLIND] ] 
[67]  >L4 
[68] A Insert matrix of data: 
[69] L5:MATLIND;COL+iLW]©¢DATALINDSECIND]I; J 
[70] COL€COL+W 
[71] 2%L4 
[72] A Exit if result has correct shape already: 
[73] L6:9(1=pSHAPE) 90 
[74] MAT¢(SHAPE,COLS) oMAT 
V 


C[WSID: MULTIFLO] 
VY INDS INDEXWS FP;DATA;DEL;DISP;F;FLD;FLDS;FT;INCR;IND; 
LAB;NFLD;NRECSNSETSS;SDISP;3;SET3SETS;SHAPE;T;TIE;UNO;W; 
WID;0I0 
Retrieves data from fields 11/FP for records 
identified by file indices matrix INDS. 
First row of INDS is set number (origin 1); 
second row is index (origin 1) within set. 
INDS may be of any dimension as long as its 
first coordinate is 2. The retrieved data are 
assigned to global variables named Fn where n 
is the number of the field retrieved (e.g. F3 
and F7 for 11/FP of 3 7). Global variables 
C10] have same number of columns as corresponding 
[11] fields. Leading shape is l1/oINDS. 
[12] QOERROR(C1#eeFP)/'RANK ERROR’ 
[13] OERRORCC11>eFP)V241ToINDS)/' LENGTH ERROR’ 
[14] O10¢1 
[15] NFLD¢pFLDS¢111FP 
[16] INCR¢FP[E3] 
[17] DEL€!IFP(11) 
[18] ONERROR(CA/FLDSELINCR) =DELEFLDS)/' INVALID FIELD NUMBER’ 
C19] TIE¢FPC2] 
C20] FT¢COFREAD TIE,4)[0;FLDS] 


Ci] 
[2] 
C3] 
C4] 
[5] 
C6] 
C7] 
[8] 
C9] 


DDDDIDDIDIDODDYD 
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V INDEXWS (continued) 


[21] 
[22] 
[23] 
[24] 
[25] 
[26] 
C27) 
[28] 
[29] 
[30] 
C31] 
[32] 
C33] 
[34] 
C35] 
C36] 
C37] 
C38] 
[39] 
C40] 
C41] 
C42] 
C43] 
[44] 
[45] 
C46] 
C47] 
C48] 
[49] 
C50] 
[51] 
[52] 
C53] 
[54] 
[55] 
[56] 
[57] 
[58] 
[59] 
[60] 
C61] 
[62] 
[63] 
[64] 
[65] 
[66] 
C67) 
[68] 
[69] 
C70] 
C71] 
C72] 
C73] 


A Width (no. columns) of specified fields: 
WID¢FTE1; ] 

QOBRRORCOEWID)/'’ INACTIVE FIELD’ 
A Shape of result (excluding columns): 
SHAPE¢1loINDS 

Aa Break apart indices: 

NREC¢oSETS¢€, 1 0 #INDS 

INDS¢<, O 1 #4INDS 

A Construct all-zero or all-blank globals. 
A Label vector needed below based upon field 
A rank and type (nvec, nmat, cvec, cmat): 
LABe(L2,L3,L4,L5)00C2LWID)+2x2=FTL23;]] 

A Last field index: 

FeO 

A Loop by specified field: 
L1:2(F2NFLD) oL7 

A Current field index and number: 

FeF+1 

FLD¢FLDS(EF] 

A Branch based upon type and width: 
>*LAB(CF] 

L2: DATA¢NRECp 0 

»>L6 

L3 : DATA¢(NREC,WID[F1])00 

>L6 

L4:DATA€NRECo' ' 

>L6 

L5:DATA©CNREC,WIDEFI])o’ ' 
L6:¢'F’,CéFLD),’€DATA’ 

%L1 

A Exit if no indices or no fields: 

L7:> CXNRECXNFLD)JJL11 

DISP¢FP[4] 


A Determine distinct set numbers (deleting “1s): 


UNQO*CSETSCASETS] 
NSET¢oUNQ¢ CUNQ# 11°71,UNQ)/UNQ 
A Last set index: 
S¢0 
A Loop by distinct set: 
L8:~(S2NSET)oeL11 
A Current set index and number: 
S€¢S+1 
SET¢UNQO[S ] 
A Displacement (no. 
SDISP¢DISP+INCRXSET+ 1 
A Indices of INDS to retrieve in this set: 
IND¢(SET=SETS)/UNREC 
A Last field index: 
FeO 
A Loop by specified field: 
L9:+(F2NFLD) oL8 
A Current field index, number and width: 
FeF+1 
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components) before this set: 


C74] 
[75] 
C76] 
[77] 
C78] 
C79] 
C80] 
C81] 
C82] 
C83] 
[84] 
[85] 
C86] 
C87] 
C88] 
[89] 
[90] 
[91] 
[92] 
[93] 
[94] 
C95] 
[96] 
C97] 
[98] 
C99] 


C1] 
C2] 
C3] 
C4] 
[5] 
[6] 
C7] 
C8] 
[9] 
[10] 
C1il] 
C12] 
bid 
C14] 
C15] 
[16] 
C17] 
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VY INDEXWS (continued) 
FLD¢FLDSCF] 
WeWIDEF] 
A Read data: 
DATA¢COU FREAD TIE,FLD+SDISP 
a Branch if matrix field: 
>(CW>1)eL10 
A Insert vector of data: 
®'F’ ,CéFLD),'CINDI<DATALINDSC IND] ]’ 
»>L9 
a Insert matrix of data: 
L10:¢’F’,CéFLD),'LIND; J©DATALINDSLIND];1]’ 
>L9 
a Exit if globals have correct shape already: 
L11:?C1=eSHAPE)e0 
A Last field index: 
F¢O 
Aa Loop by specified field: 
L12:>CF2NFLD)00 
A Current field index and number: 
FeF+] 
FLD¢FLDS[ FJ 
A Reshape globals Fn to conform with shape of indices: 
DATAc@#T¢’ F’ ,éFLD 
DATA¢ (SHAPE, 1/oDATA) pDATA 
oT, '¢DATA’ 
9L12 
V 
CWSID: MULTIFLO] 

V AAMAT¢AAEXPR SELECT AAFP; AAACTIVE; AAALL$; AAARPS 3 AABIT3 
AACHR 3; AACOL; AACOLS $ AADATA 3; AADEL; AADISP$ AAF; AAFILL; 
AAFLD; AAFLDS 3; AAFNAM; AAFT 3; AAINCR; AAIND; AAINDS 3; AALAB; 
AALV; AANDEL; AANFLD; AANSET 3 AANSFLD 3 AANUM 3 AAROWS 3; AARPS ; 
AAS 3; AASDISP3$ AASEL; AASET 5 AASETS 3 AASFLDS $ AAT; AATIE3; AAW; 
AAWID;0I0 

Aa Loops through active sets doing the following: 

A reads the fields specified in (11+(11/FP)10)/FP 

A Ccalling them F5, F9, etc. for fields 5, 9, etc.), 
A executes the character vector EXPR, and returns 

A data from fields (71+(11/FP)t0)T111IFP for the 

A records of that set which correspond to 1s in the 
A resulting bit vector. Result has one row per 

A record found and same number of columns as the 

A latter set of fields has columns. If EXPR and 

A selection fields are empty, all records are 

A selected. Note that EXPR is executed in origin 1; 
A e.g. 'F3032]' always refers to 2nd column. 

ODERROR( (1#e9p AAFP) V1<oeAAEXPR)/’RANK ERROR’ 
OERRORCO=1TOPAAEXPR)/'DOMAIN ERROR’ 

HLO¢#+l1 

Aa Extract 2 sets of fields from FP: 

AATC1L1L+FECLILVAAFP) 10 
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VY SELECT (continued) 


AANFLD¢pAAFLDS€111 CAAT+ 1) 0 AAFP 

AAALL€0=AANSFLD€ eAASFLDSCAATI AAFP 

OERROR( CAAALL=AAEXPRV.#' ')V11>pAAFP)/'LENGTH ERROR’ 
AAINCREAAFPL3 ] 

AADEL¢!1AAFP[11] 

AATCAASFLDS , AAFLDS 

OERROR( CA/AATELAATNCR) £<AADELEAAT)/'’ INVALID FIELD 
NUMBER’ 

AATIECAAFPL[ 2] 

AAFT¢UFREAD AATIE,4 

a Width (no. columns) of fields: 

AAWID¢CAAFT( 13 ] 

OERRORCOEAAWIDLAATI)/' INACTIVE FIELD’ 

a Fields must be all character or all numeric: 
AAT€2=AAFT[23AAFLDS ] 
AACHREITAAT 
OERRORCAACHRYV. #AAT)/'’ DOMAIN 

a Columns in result: 
AACOLS€+/AAWIDLAAFLDS] 

A Construct empty result: 
>AACHRpAALI 
AAMAT©(0,AACOLS ) pPAAFILLEO 
PAAL2 

AAL1: AAMAT¢€(C0,AACOLS)JPAAFILLE! ' 

A Exit if no records: 

AAL2Z:>9CXAAFPL10])J10 

A Field names (e.g. 'F5 F9’') to be erased below: 
AAFNAM¢S (CAANSFLD,1) 0AASFLDS 
AAFNAME¢'F!’',C+/' '=AAFNAM) OAAFNAM 
AADISP¢AAFPL 4] 

A Label vector needed below based upon field rank 

A and whether in WS or onfile when needed 

A (vfile,mfile,vws,mws): 

AALABe CAAL17 , AAL18 ,AAL19,AAL21)0 C2LAAWIDEAAFLDS ] )+2x 
AAFLDS€EAASFLDS ] 
AAARPS©CAARPSCOUFREAD AATIE,8 

A Branch unless ARPS should be read: 
4(=/AAFPL7 101) pAAL3 
AAARPS¢OFREAD AATIE,10 

Aa Consider only nonempty sets: 

AAL3 : AASETS€¢0#AAARPS 

A ...and sets specified in <layers>: 
>(XAAFP[1]) LAAL7 
*9(xONC ‘/'layers’ )LAAL7 

A Read layer values: 

AALV€OFREAD AATIE,9 

A Branch if matrix layers field: 
>(C2=ppAALV) PAAL4 
AASETS¢CAASETSAAALVE layers 
>AAL7 

A Convert <layers> to matrix if not already: 

AAL4: DERRORCAAFT[E1;AAFPL1]1]# 1Telayers) /’' LENGTH 
>(2=polayers) pAALS 


ERROR’ 


ERROR’ 
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V SELECT (continued) 


[69 ] 
C70] 
C71] 
[72] 
C73] 
[74] 
C75] 
C76] 
[77] 
[78] 
[79] 
C80] 
C81] 
[82] 
[83] 
[84] 
[85] 
[86] 
[87] 
[88] 
[89] 
[90] 
[91] 
[92] 
[93] 
[94] 
[95] 
[96] 
[97] 
[98] 
[99] 
[100] 
C101] 
C102] 
[103] 
C104] 
[105] 
C106] 
[107] 
C108] 
C109] 
[110] 
C111] 


C112] 
C113] 
[114] 
C115] 
C116] 
C117] 
[118] 
C119] 


layers¢((x/ llolayers), “1Tplayers)olayers 
a Branch if numeric matrix field: 
AAL5:>(O=1TOPAALV) pAALE 
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AASETS©€AASETSAClayers CMIOTA AALV)<lpelayers 


PAAL7 
AAL6: AASETS¢AASETSAV/AALVA. =Slayers 
A Convert to indices; erase <layers>: 
AAL7: AASETSCAASETS/1lU eAASETS 
AANSET¢pAASETS 
AALV¢UEX ‘layers’ 
a Last set index: 
AAS€0 
A Loop by nonempty set: 
AAL8:>CAASZAANSET) 00 
Aa Current set index and number: 
AAS€AAS+1 
AASETCAASETSLAAS] 
Aa Displacement (no. components) before this 
AASDISP€AADISP+AAINCRXAASETH 1 
a Branch if deletion field unneeded: 
> ( AANDELCAAARPS[ AASET]>0) 9 AALI 
A Read deletion field for set SET: 
AABIT¢0OFREAD AATIE,AADELtAASDISP 
Aa Branch if no selection expression: 
AALS: AASELE1 
SAAALLPAAL14 
A Are all records in this set active: 
AAACTIVECAAARPS [AASET1]=AARPSCAASET] 
a Last field index: 
AAF€O 
A Loop by field specified: 
AAL10: 39 (CAAFZAANSFLD) eAAL13 
Aa Current field index and number: 
AAFCAAFHI 
AAFLD¢AASFLDSLAAF] 
Aa Read field FLD for set SET: 
AADATACOUFREAD AATIE, AAFLD+AASDISP 


a Branch if all records in this set active: 


+AAACTIVEPAAL12 

a Branch if deletion field needed: 
XAANDELJVAALI1 

A Active records are leading records: 


AADATA€ CAAARPS[ AASET] , CAAWIDLAAFLD]>1) eAAWID[IAAFLDI)oe 


AADATA 

9AALI12 

A Apply deletion field: 

AAL11: AADATACAABITAAADATA 

A Assign data to global variable Fn: 
AAL12:¢%'F’ ,CEAAFLD),'’©€AADATA’ 
®AAL1LO 

a Once all fields have been read, 
AAL13: AASEL€¢®2AAEXPR 
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4 
C120] 
C121] 
C122] 
C123] 
C124] 
C125] 
[126] 
C127] 
C128] 
£129] 
C1i30] 
C131] 
C132] 
[133] 
C134] 
C135] 
C136] 
[137] 
[138] 
C139] 
[140] 
C141] 
[142] 
[143] 
[144] 
[145] 
[146] 
C147] 
C148] 
C149] 
C150] 
[151] 
C152] 
[153] 
[154] 
C155] 
C156] 
C157] 
[158] 
C159] 

V 


SELECT (continued) 
A Indices to retrieve (after squeezing deletions): 
AAL14: AAINDCAASEL/ 1 | AAARPSLAASET ] 
a Continue to next set if no records selected: 
+ (XAANUM€p AAIND) LAAL23 
A No. records found before this set: 
AAROWS¢10 9 AAMAT 
A Expand result for new records: 
AAMATCAAMAT, [1] CAANUM, AACOLS ) pAAFILL 
Aa Indices to retrieve (before squeezing deletions): 
AAINDS¢AAIND 
a Branch if deletion field unneeded: 
AANDELPAAL15 
A Reset INDS, considering deletion field: 
AAINDS€CAABIT/LPAABIT) CAAIND] 
a Last field index and columns inserted so far: 
AAL15: AAFEAACOLEO 
A Loop by field to retrieve: 
AAL16: > (CAAFZAANFLD) PAAL2Z3 
Aa Current field index, number and width: 
AAFCAAF+1 
AAFLDCAAFLDSLAAF] 
AAWCAAWIDEIAAFLD] 
A Branch depending on field width and whether 
A already in workspace: 
2AALABL[AAF] 
AAL17: AADATA¢CCOFREAD AATIE, AAFLD+AASDISP) CAAINDS] 
»AAL20 
AAL18: AADATACCOFREAD AATIE, AAFLD+AASDISP)CAAINDS; J 
®AAL22 
AAL19: AADATA€(C@'F! ,SAAFLD) CAAIND] 
AAL20: AACOLEAACOL+AL 
AAMAT[ AAROWS+LAANUM 3 AACOLI€¢AADATA 
XAAL16 
AAL21: AADATA¢(2'F! ,FSAAFLD) CAAIND3 ] 
AAL22: AAMAT[AAROWS+1L AANUM $3 AACOL+LAAW]©€AADATA 
AACOL¢ AACOL+AAW 
~AAL16 
A Erase field variables (e.g. F5, F9,...): 
AAL23: AATCOUEX AAFNAM 
>AAL8 
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C1] 
C2] 
bog 
[4] 
[5] 
C6] 
C7] 
C8] 
[9] 
[10] 
C11] 
[12] 
C13] 
[14] 
C15] 
C16] 
[17] 
C18] 
[19] 
C20] 
[21] 
[22] 
C23] 
[24] 
C25] 
[26] 
[27] 


[28] 
[29] 
C30] 
[31] 
C32] 
C33] 
[34] 
C35] 
[36] 
C37] 
[38] 
[39] 
[40] 
C41] 
[42] 
C43] 
[44] 
[45] 
[46] 
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>DDDDIDIIDIDIDVDDDIDVD DO 


A 


A 


A 


A 


A 


A 


A 
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([WSID: MULTIFLO] 
AAEXPR SELECTWS AAFP; AAACTIVE; AAALL; AAARPS 5; AABIT; AACOL 

s AACOLS 3; AADATA; AADEL; AADISP3$ AAF3 AAFLD; AAFLDS 3; AAFNAM;3 
AAFT; AAINCR; AAIND; AAINDS 3; AALAB; AALAB2 3 AALV; AAM1 3; AAM2 ; 
AAM3 5; AAM4 3; AANDEL;$ AANFLD; AANSET ; AANSFLD; AANUM; AAROWS ; 
AARPS 3; AAS ; AASDISP; AASEL; AASET 5 AASETS 5 AASFLDS $ AASTART 5 
AAT; AATIES$ AATYPES 3; AAW; AAWID;0I0 

Loops through active sets doing the following: 

reads the fields specified in (11+(11/FP)t10)/JFP 

(calling them F5, F9, etc. for fields 5, 9, etc.), 

executes the character vector EXPR, and retrieves 

data from fields (°1+(11/FP)LO)T11/FP for the 
records of that set which correspond to 1s in the 
resulting bit vector. The retrieved data are 
asSigned to global variables named Fn where n is 
the number of the field retrieved (e.g. F3 and F7 
if 3 7 are the numbers of the latter set of 
fields). Global variables have same number of 
columns as corresponding fields. If EXPR and 
selection fields are empty, all records are 

selected. Note that EXPR is executed in origin 1; 

e.g. 'F30;2]’ always refers to 2nd column. 

OERROR( (1#99AAFP) V1<opAAEXPR)/'’RANK ERROR’ 
DERRORCO=1TOGPAAEXPR)/'DOMAIN ERROR’ 
OLO¢1 

Extract 2 sets of fields from FP: 
AAT€11+(C11LAAFP)10 
AANFLD¢ ep AAFLDS€11J1 (AAT+~1) 0 AAFP 
AAALLEO=AANSFLD€eAASFLDSCAATLAAFP 
QERROR( CAAALL=AAEXPRV.#' 'JV11>eAAFP)/'LENGTH ERROR’ 
AAINCREAAFPL3 J 
AADEL¢!1AAFP[11] 

AAT¢CAASFLDS , AAFLDS 

DERROR( CA/AATELAAINCR) SAADELEAAT)/' INVALID FIELD 
NUMBER’ 

AATIE¢CAAFPL2 ] 

AAFT¢COFREAD AATIE,4 

Width (no. columns) of fields: 

AAWID¢AAFTLE13 ] 

OERRORCOEAAWIDLAATI)/'INACTIVE FIELD’ 

Exit if no fields: 
> CXAANFLD) LAAL36 

Datatypes (1,2,3,4) of fields to be retrieved: 
AATYPES¢AAFTL2 5; AAFLDS ] 

No. preceding cols. for each fld within its datatype: 
AAT€(1L4)°. =AATYPES 
AASTART€+#AATX( OAAT) TO, AASC+\ AATX ( PAAT) CAAWIDLAAFLDS ] 

No. columns of each datatype: 

AACOLS¢€, 4 ~1 TAAS 

Construct empty result matrices (Cone per datatype): 
AAM1€(0,AACOLS[11)e0 
AAM2€¢(CO,AACOLS[T2I])0' ' 

AAM3€(0,AACOLS[ 31) 00 
AAM4¢(0,AACOLS[41)e0 
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V SELECTWS (continued) 


C47] 
C48] 
[49] 
[50] 
[51] 
[52] 
[53] 
[54] 
[55] 
[56] 


[57] 
[58] 
[59] 
[60] 
C61] 
[62] 
[63] 
C64] 
[65] 
[66] 
[67] 
C68] 
C69] 
C70] 
C71] 
C72] 
C73] 
C74] 
C75] 
C76] 
[77] 
[78] 
[79] 
C80] 
C81] 
[82] 
[83] 
C84] 
[85] 
[86] 
C87] 
[88] 
[89] 
[90] 
[91] 
[92] 
[93] 
[94] 
[95] 
C96] 
C97] 


A Exit if no records: 
>(xXAAFPL101)1J19 

Aa Field names (e.g. 'F5 F9’') to be erased below: 
AAFNAM¢6 (AANSFLD,1) eAASFLDS 
AAFNAME'F’ ,C+/' '=AAFNAM) PAAFNAM 
AADISP€AAFP[4] 

a Label vector needed below based upon field rank 

A and whether in WS or onfile when needed 

Aa (vfile,mfile,vws,mws): 

AALAB€ CAAL15,AAL16,AAL17 ,AAL19) [ C2LAAWIDLAAFLDS1)+2~x 
AAFLDSEAASFLDS ] 

A Label vector based upon datatype: 
AALAB2€CAAL22 ,AAL23 ,AAL24,AAL25) CLAATYPES ] 
AAARPS€AARPS¢CUFREAD AATIE,8 

A Branch unless ARPS should be read: 
>(=/AASFPL7 101) eAAL1 
AAARPS©CHFREAD AATIE,10 

A Consider only nonempty sets: 

AAL1: AASETS€0# AAARPS 

A ...and sets specified in <layers>: 
>(XAAFP[1])1LAAL5 
29(xONC ‘layers’ )JAAL5 

A Read layer values: 

AALV¢OUFREAD AATIE,9 

A Branch if matrix layers field: 
+(2=0pAALV) pAAL2 
AASETS¢AASETSAAALVElayers 
9AALS5 

A Convert <layers> to matrix if not already: 

AAL2: DERRORCAAFTEL1;AAFPL1J]# 1ftolayers)/'LENGTH ERROR’ 
*(2=oolayers) eAAL3 
layers¢((x/"llolayers), 1lfelayersjoelayers 

A Branch if numeric matrix field: 

AAL3:>(O=1TOPAALV) PAAL4 
AASETS¢AASETSAClayers CMIOTA AALV)<loeplayers 
AALS 

AAL4: AASETS©AASETSAV/AALVA. =Qlayers 

Aa Convert to indices; erase <layers>: 

AALS: AASETS¢AASETS/ US AASETS 
AANSET¢pAASETS 
AALV¢COEX ‘layers’ 

A Last set index: 

AAS€O 

A Loop by nonempty set: 

AAL6:>( AASZAANSET) pAAL27 

Aa Current set index and number: 
AAS€AAS+1 
AASETCAASETSLAAS] 

A Displacement (no. components) before this 
AASDISP€AADISP+AAINCRXAASET+ 71 

A Branch if deletion field unneeded: 
> (C AANDEL©AAARPS [ AASET]>0) pAAL7 


set: 


-420- 


Chapter 14 Solutions 


FILE DESIGN AND UTILITIES 


V SELECTWS Ccontinued) 
[98] A Read deletion field for set SET: 


[99] 

[100] 
[101] 
[102] 
[103] 
C104] 
[105] 
[106] 
[107] 
[108] 
[109] 
[110] 
[111] 
C112] 
[113] 
C114] 
£115] 
C116] 
C117] 
118] 
[119] 


C120] 
C121] 
C122] 
C123] 
C124] 
[125] 
C126] 
C127] 
[128] 
C129] 
[130] 
C131] 
C132] 
C133] 
C134] 
[135] 
C136] 
[137] 
C138] 
C139] 
[140] 
[141] 
[142] 
[143] 
[144] 
[145] 
C146] 
[147] 
C148] 


AABIT¢CUFREAD AATIE, AADEL+AASDISP 

Aa Branch if no selection expression: 

AAL7: AASEL¢1 

2 AAALLEAAL12 

a Are all records in this set active: 
AAACTIVECAAARPS[ AASET]=AARPS[ AASET] 

Aa Last field index: 

AAF€0O 

A Loop by field specified: 
AAL8:2(AAFZAANSFLD) eAALI1 

a Current field index and number: 

AAFCAAFH+1 

AAFLD¢AASFLDSLAAF] 

a Read field FLD for set SET: 

AADATACU FREAD AATIE, AAFLD+AASDISP 

A Branch if all records in this set active: 
SAAACTIVEPAAL10 

a Branch if deletion field needed: 

XAANDELJAALY 

a Active records are leading records: 

AADATA€ CAAARPSLAASET] , CAAWID[L AAFLD1>1)  eAAWIDLAAFLD] )o 
AADATA 

%AAL10 

A Apply deletion field: 

AALY: AADATACAABITZAADATA 

A Assign data to global variable Fn: 

AALI10: @'F’ ,CEAAFLD) ,'’€AADATA’ 
2AAL8 

A Once all fields have been read, 
AAL11: AASEL¢E®AAEKPR 

Aa Indices to retrieve Cafter squeezing deletions): 
AAL12: AAIND€AASEL/ 1 | AAARPS[ AASET] 

A Continue to next set if no records selected: 
> (XAANUM¢p AAIND) LAAL26 

A No. records found before this set: 
AAROWS¢10epAAM1 

A Expand result matrices for new records: 
AAM1©AAM1,011  CAANUM, AACOLS[ 11) 90 
AAM2¢AAM2,CIICAANUM,AACOLS[2])o0' ' 
AAM3€AAM3 , C1] CAANUM, AACOLS[ 31) 90 

AAM4©€AAM4 ,£1](AANUM, AACOLS[41])090 

Aa Indices to retrieve (before squeezing deletions): 
AAINDS©€AAIND 

A Branch if deletion field unneeded: 
>AANDELeGAAL13 

A Reset INDS, considering deletion field: 
AAINDS€ CAABIT/LPAABIT) CLAAIND] 

A Last field index: 

AAL13: AAF€¢O 

A Loop by field to retrieve: 

AAL14: > CAAFZAANFLD) PAAL2Z6 


execute EXPR: 


~421- 


Chapter 14 Solutions 


FILE DESIGN AND UTILITIES 


SELECTWS (continued) 

A Current field index, number and width: 
AAFCAAF+1 

AAFLDCAAFLDSEAAF] 

AAWCAAWIDLAAFLD] 

A Branch depending on field width and whether 

a already in workspace: 

>AALAB[AAF] 

AAL15: AADATA€COFREAD AATIE,AAFLD+AASDISP) CAAINDS ] 
2AAL18 

AAL16: AADATA¢COFREAD AATIE, AAFLD+AASDISP) LAAINDS; ] 
2AAL2Z0 

AAL17: AADATA€ (@'F' ,SAAFLD)LCAAIND] 

AAL18 : AACOLCAASTARTLAAFJ+1 

X~AAL2Z1 

AAL19: AADATA€(e'’F!’ ,SAAFLD) CAAIND; ] 

AAL20: AACOLCAASTARTE AAFJ]+LAAW 

A Branch based upon datatype: 

AAL21:2~AALAB2LAAF] 

AAL22: AAMI1TAAROWS+LAANUM $3 AACOLJI©¢AADATA 

2AALI4 

AAL23 3 AAM2 [AAROWS+LAANUM 3 AACOLJ¢AADATA 

XAALI14 

AAL24: AAM3 [AAROWS+LAANUM 3; AACOLI©AADATA 

®AAL14 
AAL25: AAM4[ AAROWS+LAANUM; AACOLI]©€AADATA 

%~AAL14 

A Erase field variables (e.g. F5, F9,...): 
4AL2Z26: AATCOUEX AAFNAM 

>AALE 

A Label vector by datatype needed below: 
AAL27: AALABCCAAL31, AAL32,AAL33 ,AAL34) LAATYPES ] 

Aa Last field index: 

AAF¢0 

A Loop by field to retrieve: 
AAL28: > (CAAFZAANFLD) 00 
Aa Current field index, 
AAFCAAF+1 
AAFLD¢AAFLDSLAAF] 
AAWCAAWIDEAAFLD] 

A Branch if matrix field: 

+( AAW>1) pAAL29 
AACOL¢CAASTARTL AAFJ+1 

2AAL30 

AAL29: AACOLCAASTART[ AAF I+ lL AAW 
A Branch based upon datatype: 
AAL30: 9AALABL[ AAF] 

AAL31: AADATACAAMLE 5; AACOL] 
»AAL35 

AAL32: AADATACAAM2[ 3 AACOL] 
®AAL35 

AAL33: AADATACAAM3E 3; AACOL] 
>AAL35 

AAL34: AADATACAAM4T 5 AACOL] 


number and width: 
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V SELECTWS (continued) 
[202] aA Assign to Fn vars.: 
[203] AAL35:@'F' ,CEAAFLD),'’©AADATA’ 
[204] »AAL28 
[205] AAL36: AAS¢0EX ‘layers’ 
V 


C[WSID: MULTIFLO] 
V R¢eA ASSIGN B 
C1) aA Used as: rinds INDEXA (FP,flds) ASSIGN mat 
C2] R¢A 
[3] assign¢B 
V 


CWSID: MULTIFLO] 

V NFP¢INDS INDEXA FP;A;AFLD;ARPS3;BAD;BIT;BLK;CMP;COL; 

COLS ;DATA;DCMP;DEL;DISP;F;FILED;FILL;FLD;FLDS;FREC;FT; 

GOOD; I1;ITINDS;INCR; IND; LAY; LAYER; LEAD; LSETS ; LV; LVAR3;M;3 

MINsNsNF3SNFLD3SNR3;NREC$NSET$RPS3S3SD;sSDISP3;SET;SETS;3 
SHAPE;SIND;SINGLE;T;TIE;UNQ3;VAR;VEC3sW;WID;U0I0 

Used as: FP¢rinds INDEXA (FP,flds) ASSIGN mat 
Inserts data from global matrix <assign> into 
fields 11/FP for records identified by file 

indices matrix INDS. First row of INDS is set 
number (origin 1): second row is index (origin 1) 
within set. INDS may be of any dimension as long as 
its first coordinate is 2. <assign> has same number 
of columns as fields 11/FP have columns. 

[9] Leading shape of <assign> is 1lloINDS. <assign> is 

C10] erased upon completion. 

[11] OMERROR(C1#eeFP)/'RANK ERROR’ 

[12] OERRORCC11>eFP)V241ToeINDS)/'LENGTH ERROR’ 

C13] O10O¢1 

C14] NFLD¢eFLDS¢€111FP 

[15] NFP¢lloFP 

[16] INCR¢FP(3] 

C17] DEL€!IFPL11] 

C18] FILL¢CFPL[11)]<0 

[19] OERRORCCA/FLDSELINCR) =DELEFLDS)/’'INVALID FIELD NUMBER’ 

C20] TIE¢FP2] 

[21] FT¢OFREAD TIE,4 

[22] A Width (no. columns) of specified fields: 

[23] WID¢FT(1;] 

[24] COLS¢+/W<eWIDL[FLDS ] 

[25] QMERRORCOEW)/' INACTIVE FIELD’ 

[26] a Fields must be all character or all numeric: 

[27] S€¢2=1TT¢FTL[2;FLDS] 

[28] OMERRORCSV.#2=T)/'DOMAIN ERROR' 

[29] A Break apart indices: 

[30] SHAPE¢1/oINDS 

[31] NREC¢eSETS¢, 1 0 4INDS 

[32] INDS¢, 0 1 ZINDS 


C1] 
C2] 
C3] 
C4] 
C5] 
C6] 
C7] 
C8] 


DDDDIDIDIDIO 
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VY INDEXA (Ccontinued) 


[33] 
[34] 
[35] 
[36] 


A Singleton data? 
SINGLE¢14.=oassign 
+SINGLEeL3 
Aa Branch unless singleton (vector) record: 


[37] 2%C(CCOLS#1)A1A.="lloassign)/L1 

[38] OERROR(COLS# 1teassign)/'LENGTH ERROR’ 

[39] »L2 

[40] L1:0GERROR( (oeSHAPE)A.#(opassign)+ = 1, (COLS=1)e0)/' RANK 


ERROR’ 


[41] S€¢(C1+eSHAPE)e(passign),1 

[42] JDERROR(SV.#SHAPE,COLS)/’ LENGTH ERROR’ 

[43] »(2=o9passign)oeL3 

[44] L2:assign¢(NREC,COLS) passign 

[45] aA Exit if no records or no fields: 

[46] L3:%(xNRECXNFLD)/JL43 

C47] LAY¢FP[1] 

[48] DISP¢«FP([4] 

[49] BLK¢FP(5] 

[50] aA Read LV, RPS, ARPS if layer fld being assigned: 
[51] 7» CLAYE€FLDS)IL4 

[52] LV¢OFREAD TIE,9 

[53] RPS©ARPS¢OFREAD TIE,8 

[54]  3(=/FPL7 101)eL4 

[55] ARPS¢OFREAD TIE,10 

[56] aA Determine distinct set numbers (deleting “l1s): 
[57] L4:UNQ<SETSCASETS] 

[58] NSET¢eUNQ¢ CUNQ# “1!1°1,UNQ) /UNQ 

[59] A Last set index: 

[C60] SIND¢O 

[61] A Loop by distinct set: 

C62] L5:%(SIND2NSET) 0 L43 

[63] A Current set index and number: 

C64] SIND¢SIND+1 

C65] SET<CUNQCSIND] 

[66] A Displacement (no. components) before this set: 


C67] SDISP€¢DISP+INCRXSET+ 1 

[68] a Indices and elts of INDS to retrieve in this set: 
[69] IND¢ (CSET=SETS)/tNREC 

[70] I€INDSCIND]I 

[71] A Elts of INDS of recs whose layer value changes: 
C72] BAD€10 

[73] A Last field index, and columns filed so far: 
[74] F¢coL¢o 

[75] A Loop by specified field: 

[76] L6:%CF2NFLD)eL13 

(77] A Current field index, number and width: 

[78] FeF+t+l] 

[79] FLD¢FLDS(EF] 

[C80] a Is this the layer field being changed? 

[81] LAYER¢LAY=FLD 

C82] WewWIDCFLD] 

{831 A Read data: 


[84] 


DATACO FREAD CMP¢TIE, FLD+SDISP 
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VY INDEXA (continued) 
e Branch if matrix field: 
>(W>1)eL9 
Aa Insert vector of data. 
Aa Branch if singleton: 


[85] 
[86] 
[87] 
C88] 


C89] 
[90] 
C91] 
[92] 


XSINGLEoeL7 

COL¢COL+1 
DATALIJ¢assignlLIND;CoL] 
>L8 


[93] 
[94] 
[95] 
[96] 


L7: DATALL]¢assign 

A Branch unless layer field: 
L8:>LAYERIL12 

a Flag recs whose layer val has changed: 


[97] 

[98] 

[99] 

[100] 
C101] 
[102] 
C103] 
[104] 
[105] 
C106] 
C107] 
[108] 
C109] 
C110] 
C111] 
C112] 
C113] 
C114] 
[115] 
[116] 
C117] 
C118] 
[119] 
f120] 
C121] 
C122] 
C123] 
[£124] 
C125] 
[126] 
[127] 
C128] 
[129] 
C130] 
C131] 
[132] 
C133] 
C134] 
[135] 
[136] 
C137] 


BAD¢CLVUESET]#DATACII])/1I 
2L12 


A Insert matrix of data. 


Aa Branch if singleton: 

L9:>2SINGLEeL10 
DATA[LI;]¢assign(IND;COL+liW] 
COL¢€COL+W 
>L1il 

L10:DATA[ I; ]¢assign 

A Branch unless layer field: 

L111: »*LAYERIL12 

A Flag recs whose layer val has changed: 
BAD¢€CDATA[LI;]V.4LVISET; 1) /I1 

A Replace data on file: 

L1i2:DATA OFREPLACE CMP 
»L6 

A Next set if layer values unchanged: 

L13:39(xXpBAD)JIL5 

A Indices of active fields: 
AFLD¢CxWID)/LINCR 

A Exclude deletion field: 

NF¢oAFLD¢ (AFLD#DEL) /AFLD 

A Flag records filed so far: 
FILED¢( eBAD) 00 

A Look at layer field: 
LVAR¢OQOFREAD TIE, LAY+SDISP 

A Branch if vector field: 
VEC¢WID(I LAY] =1 

L14:2VECeL15 

A Flag records with layer of 1st unfiled rec: 
LAYER¢LVARL BADE FILED1iO];] 
GOOD¢LVAR[I BAD; ]A.=LAYER 

A ...and sets with this record: 
LSETS¢LVA.=LAYER 
»L16 

L15: LAYER¢LVAR[ BAD[ FILEDi0] J 
GOOD¢LVAR[ BAD] =LAYER 
LSETS¢LV=LAYER 

A Update FILED: convert to indices: 

L16: FILED¢FILEDVGOOD 
GOOD¢GOOD/ BAD 
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INDEXA (continued) 
NR¢pGOOD 
Aa Consider only non-full sets in this layer or 
Aa empty sets in any layer: 
LSETS¢€ CARPS=0) VLSETSAARPS# BLK 
A Convert to indices: 
LSETS¢LSETS/teLSETS 
Aa No. records filed so far: 
FREC€O 
A Branch if no slots available in existing sets: 
L17:>(BLK=MIN¢eL /A©lARPSCLSETS1)0L32 
A Branch if more than 1 set needed: 
T¢NR-FREC 
+(T>BLK-MIN) oL18 
A Choose fullest set which will hold all recs: 
S¢BLK-A 
S€¢LSETSCStL/(S2T)/S] 
2119 
A Choose set with most empty slots: 
L18:S¢LSETS(LAUMIN J 
A No. records to be inserted within the set: 
L19:M¢TLRPS£SI-!1ARPSLS] 
A No. records to be catenated within the set: 
N¢€CT-M)LBLK-RPS[S] 
A Displacement (no. 
SD¢DISP+INCRXS+ 1 
A Branch if no deletion field: 
»>(xXDEL)JJL20 
A Read deletion field for set S: 
BIT¢OFREAD DCMP¢TIE, DEL+SD 
A Branch if no records to be inserted: 
>(xM)1L20 
A Indices of available insertion slots: 
LINDS¢Me(~BIT)/toeBIT 
A Next field index: 
L20: F€1 
A Loop by active field: 
L21:>(F>NF)eL26 
A Field number: 
FLD¢AFLD[IF] 
A Field width Cno. 
WeWIDLFLD]) 
A Look at original set: 
VARCOFREAD TIE,FLD+SDISP 
A Read field F for set S: 
DATACUFREAD CMP¢TIE,FLD+SD 
A Branch if a matrix field: 
>(CW>1)eL23 
A Branch if no records to insert: 
9(xM)/L22 
DATAL[L TINDS J]€VAR[ GOOD[ FREC+1iMJ] 
Aa Branch if no records to catenate: 
9CxXN)JLL25 
L22: DATA©DATA, VARI GOODE (FREC+M)+1iN]] 


components) before this set: 


columns): 
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V INDEXA (continued) 
C1911] »L25 
[192] A Branch if no records to insert: 
[193] L23:9C(xM)LL24 
[194] DATALIINDS; ]1€VAR(I GOODE FREC+iM]:] 
[195] A Branch if no records to catenate: 
[196] ~»CxXN)LL25 
[197] L24:DATA¢DATA, [LJ VAR[CGOOD[ (FREC+M)+iNJ3 1] 
[198] L25:DATA OFREPLACE CMP 
[199] FeF+l1 
[200] 2%L21 
[201] a Branch if no deletion field: 
[202] L26: LEAD¢el1 
[203] %CxDELIJJL29 
[204] A Branch if no records to insert: 
[205] %*CxM)1lL27 
[206] aA Turn active record bits on: 
[207] BITCIINDS J€1 
[208] LEAD¢A/BIT=A\BIT 
[209] a Branch if no records to catenate: 
[210] 9(xN)JL28 
[211] L27:BIT¢+BIT,Nol 
C212] L28: BIT OFREPLACE DCMP 
[213] A Increment FREC by no. records added to this set: 
[214] L29: FREC¢FREC+M+N 
(215] RPSLSJ€¢RPSCSI+N 
[216] NFPL9I¢NFPL9I1+T€¢O0=ARPS(IS] 
[217] A Replace layer value if set initially empty: 
C218] r%TLlL31 
[219] a Branch if a vector layer field: 
[220] »VECeL30 
C2211 LV{S;]J¢LAYER 
[222] %L31 
[223] L30: LVLSJ]<¢LAYER 
[224] L31:ARPS(S]€¢C°1 1)01+LEADI]XM+N+IARPSCS] 
[225] NFPL7IJ¢NFPL7I+N 
[226] A Exit if all of data in field vars. filed: 
[227] > (NR=FREC) oe L40 
[228] 2»L17 
[229] A No. records to be appended in next set: 
[230] L32:N¢€BLKLNR-FREC 
[231] aA Next field number: 
C232] Fel 
[233] A Loop by field: 
[234] L33:2(F>INCR) eL39 
[235] WewIDC(F] 
[236] a Branch unless a latent field: 
[237]  »(xW)eL34 
[2381 DATA€LO 
[239]  %L38 
[240] A Branch unless it’s the deletion field: 
[241] L34:>%(DEL#F)eL35 
[242] DATA+Nol 
[2431] 7L37 


a2) 


Chapter 14 Solutions 


[265] 
[266] 
[267] 
C268] 
[269] 
C270] 
(271] 
C272] 
C273] 
[274] 
[275] 
[276] 
C277] 
[278] 
[279] 
C280] 
C281] 
C282] 
[283] 
[284] 
[285] 
[286] 
[287] 
[288] 
[289] 
[290] 
[291] 
C292] 
[293] 
C294] 
[295] 


INDEXA (continued) 

A Look at original set: 

L35:VAR¢COUFREAD TIE,F+SDISP 

Aa Branch if a matrix field: 

>(W>1)0L36 

DATA¢VAR[ GOODL[ FREC+iNJ ] 

9L37 

L36: DATA¢VAR[ GOODE FRECH+1iN] 3] 

A Branch unless set must be padded to BLK records: 
L37:°9FILLIL38 

DATA*¢ (BLK,1/oDATA) TDATA 
L38:T¢DATA UFAPPEND TIE 
FeF+i] 

9133 

A Increment FREC by no. 
L39: FREC¢€FRECt+N 
ARPS¢ARPS ,N 
RPS¢RPS , TENI BLKXFILL 
NFPC7I¢E¢NFP(71+T 
NFPL6]¢NFPL[6]+1 
NFPLOIJENFP(9)]+1 

A Catenate layer value: 
LV¢LV, (1LJLAYER 

A Continue unless all of data in field vars. 
> (NR#FREC) 0 L32 

A Branch if no data left to file: 
L40:9(CA/FILEDJIL14 

A 

Aa Branch unless deletion field exists: 
>*(xDEL)JIL41 

A Read deletion field for set SET: 
BIT¢OFREAD CMP¢TIE,DEL+SDISP 

A Turn off specified indices and replace: 
BITCBADJ]€0 

BIT OFREPLACE CMP 
A Compute new no. 
N¢+/BIT 
M¢IRPSCSET J 

A Reset parameters: 
NFPL9J¢NFP[9]J-N=0 
ARPS[CSETI€NxXC71 1)01+N=+/A\BITI 
9L5 

A 

Aa Turn off specified indices: 
L41:M¢RPS([SET] 
BIT¢Mel 

BITC BADI]¢<0 

A Compute new no. 
N¢+/BIT 

A Reset parameters: 
NFPLOI¢NFPL9OI-N=0 
NFPC7IJ¢NFPC71+N-M 
RPS(CSETJ€N 


records added to this set: 


filed: 


records: 


records: 
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V INDEXA (continued) 
[296] A Last field index: 


[297] 
[298] 
[299] 
[300] 
[301] 
[302] 
C303] 
[304] 
[305] 
[306] 
[307] 
[308] 
[309] 
C310] 
{311] 
[312] 
C313] 
[314] 
C315] 
{316] 

V 


C1] 
[2] 
C3] 
C4] 
[5] 
[6] 
C7] 
C8] 
C9] 
[10] 
C11] 
C12] 
C13] 
C14] 
LL. J 
C16] 
C17] 
C18] 
[19] 
[20] 
C21] 
[22] 
[23] 


DDDDOBIIDIDDODODV 


A 


F¢O 
A Loop by active field: 
L42:9(F2NF)0L5 
Aa Current field index and number: 

FeFr+] 

FLD¢AFLD[F] 
A Read, compress, replace field FLD for set SET: 
CMP¢TIE,FLD+SDISP 

(BIT/OFREAD CMP)OFREPLACE CMP 

>L42 
A 
A Erase <assign> and exit: 
L43:F¢OEX ‘assign’ 

a Replace LV, RPS, ARPS if layer field assigned: 
»>(LAYE€FLDS)LO 

LV OFREPLACE TIE,9 

NFP OFREPLACE TIE,7 

RPS OFREPLACE TIE,8 

>(xDEL)LO 

ARPS OFREPLACE TIE,10 


CWSID: MULTIFLO] 

NFP¢INDS INDEXWSA FP3A;AFLD;ARPS;BAD;BIT;BLK;CMP;3 DATA; 
DCMP;DEL;DISP;F;FILED; FILL; FLD;FLDS; FREC;FT;GOOD;1; 
IINDS;INCR; IND; LAY; LAYER; LEAD; LSETS;LV;LVARSM;MIN;N;NF 
sNFLD;NR3NREC$SNSET;SRPS3S;3;SD;SDISP3;SET;SETS ;SHAPE;SIND; 
T;5TIE;U0NQ;VAR;VECSW;WID;010 

Inserts data from global field variables (e.g. F3, 
F5,... for fields 3, 5,...) into fields 11/FP for 
records identified by file indices matrix INDS. 

First row of INDS is set number (origin 1); 

second row is index (origin 1) within set. 

INDS may be of any dimension as long as its 

first coordinate is 2. Global field variables 

have same number of columns as corresponding 

fields. Leading shape is 1loeINDS. Global field 
variables are erased upon completion. 
OERROR(C1#eeFP)/'RANK ERROR’ 
OERROR((11>oFP)V2#1ToINDS)/'’LENGTH ERROR’ 

Exit if no fields: 
NFLD¢pFLDS€111FP 
NFP¢1loFP 
*(XNFLD)LO 

O10<«1 

INCR¢FP[3] 
DEL¢€/]FP£11] 
FILL¢FPL111)<0 
OERROR(CA/FLDSELINCR) =DELEFLDS)/'INVALID FIELD NUMBER’ 
TIE¢FP[2] 
FT¢OFREAD TIE,4 
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VY INDEXWSA Ccontinued) 


a Width (no. columns) of specified fields: 
WID¢FTL[ 13 ] 

OERRORCOEWIDEFLDSJ)/'INACTIVE FIELD’ 

Aa Shape of field variables (excluding columns): 
SHAPE¢11oINDS 

a Last field index: 

F¢O 

A Loop by specified field to verify field vars 
a (bypass this loop to make the function faster 
A and to live dangerously): 
L1:2(F2NFLD)oL3 

a Current field index, 
FeFti 

FLD¢FLDS[F] 

W¢w1ID( FLD] 

A Look at field variable: 

S¢oDATA¢e'F’,5FLD 
QOEFRROR(CC24FT0C23;FLD1]1)40=1TOpDATA)/'DOMAIN ERROR’ 
A Done if singleton data: 
9(1A.=S)eL1 

A Branch unless ‘singleton’ 
®9CCWALIAIA.="115S)1L2 
QOERRORCW#71TS)/' LENGTH ERROR’ 

»*L1 

L2: OERROR( (eSHAPE)A.#(0S)+71, (CW=1)00)/'RANK ERROR’ 
N€(1+oSHAPE)0eS,1 

OHERROR(NV.#SHAPE,W)/'LENGTH ERROR’ 

>L1 

A Break apart indices: 

L3:NREC¢pSETS¢, 1 0 #INDS 

INDS¢, O 1 4INDS 

A Exit if no indices: 

»*(xXNREC)J1L43 

LAY¢FP[1] 

DISP¢FP(4] 

BLK¢FP(5] 

A Read LV, RPS, ARPS if layer fld being assigned: 
»>*(LAYE€FLDS)1J1L4 

LV¢OFREAD TIE,9 

RPS¢ARPS¢UFREAD TIE,8 

+(=/FPL7 10])eL4 

ARPS¢OFREAD TIE,10 

a Determine distinct set numbers (deleting “1s): 
L4:UNQ¢SETSLASETS ] 

NSET¢oUNQ€ CUNQ# “11 °71,UNQ)/UNQ 

a Last set index: 

SIND¢€0O 

A Loop by distinct set: 
L5:29C(SIND2NSET) oL43 

Aa Current set index and number: 

SIND¢SIND+1 

SET¢UNQLSIND] 


number and width: 


record: 
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V INDEXWSA Ccontinued) 


[76] A Displacement (no. components) before this set: 


C77] 
[78] 
[79] 
[80] 
[81] 
C82] 
C83] 
[84] 


SDISP¢+DISP+INCRXSET+ 1 


© Indices of INDS to retrieve in this set: 


IND¢(SET=SETS)/tNREC 
I¢INDSCIND] 


A Elts of INDS of recs whose layer value changes: 


BAD€10 


a Last field index: 


F<0 


C85] 
[86] 
C87] 


A Loop by specified field: 
L6:%(F2NFLD)eL13 
A Current field index, number and width: 


[88] 
[89] 
[90] 
[91] 
[92] 
[93] 
[94] 
[95] 
[96] 
[97] 
[98] 


[99] A Branch if 'singleton’ 


[100] 
C101] 
C102] 
[103] 
[104] 
C105] 
[106] 
[107] 
[108] 
[109] 
C110] 
[111] 
C112] 
C113] 
C114] 
[115] 
[116] 
[117] 
C118] 
C119] 
C120] 
C121] 
C122] 
[123] 
C124] 
C125] 
[126] 
[127] 


FeF+] 
FLD¢FLDSCEF] 


A Is this the layer field being changed? 


LAYER¢LAY=FLD 
WeWIDC FLD] 


RA Read data: 


DATACOFREAD CMP¢TIE,FLD+SDISP 


Aa Current field var: 


VAR¢#’'F’ ,éFLD 


Aa Continue if singleton: 


¥(1A.=o0VAR)eL10 
record: 
9CCWALIAIA. =" 1LoVAR) eo L7 

Aa Branch if rank OK already: 
>( Coe VAR) =2LW)eL8 
L7: VAR¢ (NREC, (W>1) oW) eVAR 

A Branch if matrix: 
L8:>(W>1)0L9 

VARCVARL IND] 

>L10 
L9: VAR¢VAR[ IND; ] 

a Branch if matrix: 
L10:2(W>1)eL11 

A Insert data: 

DATALTJ]¢VAR 

Aa Branch unless layer field: 
®9LAYERJL12 

A Flag recs whose layer val has 
BAD¢CLVLSETI#4VAR)/TI 

9L12 

Lil: DATALI; J¢VAR 

A Branch unless layer field: 
»LAYER!JL12 

A Flag recs whose layer val has changed: 
BAD¢CVARV.#LVLSET;1])/1 

Aa Replace data on file: 

L12:DATA UFREPLACE CMP 

>L6 

A Next set if layer values unchanged: 
L13:3%CxpBAD)IL5 


changed: 
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V 
C128] 
C129] 
[130] 
C131) 
C132] 
[133] 
[134] 
[135] 
[136] 
C137] 
[138] 
[139] 
C140] 
C141] 
[142] 
[143] 
[144] 
[145] 
[146] 
C147] 
£148] 
[149] 
[150] 
C151] 
C152] 
C153] 
£154] 
C155] 
C156] 
C157] 
C158] 
£159] 
[160] 
[161] 
[162] 
£163] 
C164] 
C165] 
[166] 
[167] 
C168] 
[169] 
C170] 
C171) 
C172] 
C173] 
C174] 
C175] 
C176] 
C177] 
[178] 
C179] 
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INDEXWSA (continued) 
A Indices of active fields: 

AFLD¢€(xXxWID)/LINCR 
A Exclude deletion field: 

NF¢oAFLD¢ C(AFLD4DEL) /AFLD 

A Flag records filed so far: 

FILED¢( oBAD) 90 

a Look at layer field: 

LVAR¢COFREAD TIE, LAY+SDISP 

A Branch if vector field: 

VECeEWIDE LAY jJ=1 

L14:9VECeL1L5 

a Flag records with layer of 1st unfiled rec: 
LAYER¢LVARL[ BADE FILEDtO]; ] 

GOOD¢ LVAR[ BAD; JA. =LAYER 

A ...and sets with this record: 

LSETS€¢LVA. =LAYER 

>L16 

L15: LAYER¢LVARE BAD[LFILEDtO] ] 
GOOD¢LVARI BAD] =LAYER 

LSETS€LV=LAYER 

A Update FILED; convert to indices: 

L16: FILED¢FILEDVGOOD 

GOOD¢GOOD/ BAD 

NR¢oeGOOD 

A Consider only non-full sets in this layer or 
A empty sets in any layer: 

LSETS¢(CARPS=0) VLSETSAARPS # BLK 

Aa Convert to indices: 

LSETS¢LSETS/toeLSETS 

A No. records filed so far: 

FREC€¢0 

Aa Branch if no slots available in existing sets: 
L17: > (CBLK=MIN¢L /A¢lARPSCLSETS])0L32 

A Branch if more than 1 set needed: 

T¢NR-FREC 

+(T>BLK-MIN) eL18 

A Choose fullest set which will hold all recs: 
S€¢BLK-A 

S€¢LSETSCStiL/CS2T)/S] 

9L19 

A Choose set with most empty slots: 
L18:S€¢LSETS[LAUMIN J 

A No. records to be inserted within the set: 
L19:M¢TLRPSESJI-!ARPSCS] 

A No. records to be catenated within the set: 
N¢(CT-M)LBLK-RPSES] 
A Displacement (no. 
SD¢DISP+INCRxXS+ 1 
A Branch if no deletion field: 
»*(xDEL)ILL20 

A Read deletion field for set S: 
BIT¢OFREAD DCMP¢TIE, DEL+SD 


components) before this set: 
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INDEXWSA (continued) 
A Branch if no records to be inserted: 
»*(xM)1L20 
A Indices of available insertion slots: 
IINDS¢Mo (~BIT)/teBIT 
A Next field index: 
L20: Fel 
A Loop by active field: 
L21:>(F>NF)oeL26 
A Field number: 
FLD¢AFLD[IF] 
A Field width (no. 
W€WIDLE FLD] 
A Look at original set: 
VAR¢DFREAD TIE, FLD+SDISP 
A Read field F for set S: 
DATA¢DFREAD CMP¢TIE,FLD+SD 
a Branch if a matrix field: 
+(W>1)eL23 
A Branch if no records to insert: 
9(xM)1L22 
DATAL ITINDS ]¢VAR[LGOODL[ FREC+iM] J 
a Branch if no records to catenate: 
9(xXN)LJL25 
L22: DATA¢DATA, VARI GOODE (CFREC+M)+1N] J 
9L25 
A Branch if no records to insert: 
L23:9C€xXM)1lL24 
DATACT ITINDS ; JEVARL GOODE FREC+iMI13] 
Aa Branch if no records to catenate: 
(XN) LL25 
L24: DATA©DATA, [1JVARE GOODE (FREC+M)+ iN]; ] 
L25:DATA OFREPLACE CMP 
FeFt] 
>L21 
A Branch if no deletion field: 
L26: LEADe1 
*(xDEL)JL29 
A Branch if no records to insert: 
>*(xM)LL27 
a Turn active record bits on: 
BITCIINDS]€1 
LEAD¢A/BIT=A\BIT 
@ Branch if no records to catenate: 
2*(xXN)JL28 
L27:BIT¢BIT,Nol 
L28: BIT OFREPLACE DCMP 
A Increment FREC by no. 
L29: FREC¢€FREC+M+N 
RPSCSJ]€RPSCSI+N 
NFPLOYJ¢NFPL91+T¢O=ARPSES] 
A Replace layer value if set initially empty: 
®9TIL31 


columns): 


records added to this 
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[254] 
[255] 
[256] 
C257] 
C258] 
[259] 
[260] 
[261] 
[262] 
[263] 
[264] 
[265] 
[266] 
[267] 
[268] 
[269] 
[270] 
C271] 
C272] 
[273] 
C274] 
[275] 
[276] 
[277] 
C278] 
[279] 
C280] 
C281] 
[282] 
C283] 


FILE 


INDEXWSA (continued) 

A Branch if a vector layer field: 
®VECeL30 

LVUCS;]¢€LAYER 

9031 
L30: LVIS]J¢LAYER 
L31:ARPS(CSJ€C 7-1 1)£1+LEAD]XM+N+I1ARPS[S] 
NFPL7J¢NFPE7]+N 

A Exit if all of data in field vars. 
+(NR=FREC) eL40 

9L17 

A No. records to be appended in next set: 
L32:N¢BLKLNR-FREC 

A Next field number: 

Fel 

A Loop by field: 
L33:2(F>INCR)oL39 

WEWIDEIF] 

A Branch unless a latent field: 
>(xW)0L34 

DATA€tO 

%L38 

A Branch unless it's the deletion field: 
L34:%9(DEL#FF)eL35 

DATA¢Noe1 

937 

a Look at original set: 
L35: VAR¢OFREAD TIE,F+SDISP 

A Branch if a matrix field: 

>CW>1) eL36 

DATA¢VAR[I GOOD[ FREC+iN] ] 

»°L37 

L36: DATA¢ VARI GOOD[ FREC+iN]J 3] 


DESIGN AND 


filed: 


UTILITIES 


A Branch unless set must be padded to BLK records: 


L37:°FILLJL38 

DATA (BLK, 1/oDATA) TDATA 
L38:T¢DATA OFAPPEND TIE 
FeF+1 

»L33 

A Increment FREC by no. 
L39: FREC¢FREC+N 
ARPS¢ARPS ,N 
RPS¢RPS , TENF BLKXFILL 
NFPL7IJ¢NFPC71+T 
NFPC6J¢NFP(6]+1 
NFPLOJ¢NFPL9J]+1 

A Catenate layer value: 
LV¢LV,C1] LAYER 


A Continue unless all of data in field vars. 


> (NR#FREC) 0L32 

A Branch if no data left to file: 
L40:9CA/FILED)JJL14 

A 
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V 
C284] 
[285] 
[286] 
[287] 
[288] 
C289] 
[290] 
C291] 
[292] 
[293] 
[294] 
[295] 
[296] 
[297] 
[298] 
[299] 
[300] 
C301] 
C302] 
C303] 
C304] 
C305] 
[306] 
[307] 
[308] 
[309] 
C310] 
ae Es Ee 
C312] 
C313] 
{314] 
[315] 
[316] 
[317] 
[318] 
[319] 
C320] 
C321] 
[322] 
C323] 
[324] 
(325] 
[326] 
[327] 
[328] 
[329] 
C330] 

Vv 


INDEXWSA (continued) 

Aa Branch unless deletion field exists: 
9(xDEL)JL41 

8 Read deletion field for set SET: 
BIT¢OUFREAD CMP¢TIE,DEL+SDISP 

a Turn off specified indices and replace: 
BITC BAD]€0O 
BIT OFREPLACE CMP 

A Compute new no. 
N¢e+/BiT 
M¢!|RPSCSET] 

A Reset parameters: 
NFPLO9J¢NFPLO9IJ-N=0 
ARPS[TSETI€¢NXC71 1L)IC1+N=4+/A\BITI 
>L5 

A 

A Turn off specified indices: 

L41:M¢RPS(CSET]J 
BIT¢Mo1 
BITE BAD1]<0O 

A Compute new no. 
N¢+/BIT 

A Reset parameters: 
NFPC9OI¢NFPL9]-N=0 
NFP(C71¢NFP(7J+N-M 
RPSCSETI€¢N 

Aa Last field index: 

F¢O 

A Loop by active field: 

L42:9(F2NF)oL5 

Aa Current field index and number: 
FeF+1 
FLD¢AFLDIF] 

A Read, 
CMP¢+TIE, FLD+SDISP 
CBITAOFREAD CMP)JOFREPLACE CMP 
9L42 

A 

A Erase global field variables: 

L43:F€¢s(NFLD,1)oFLDS 
FCOEX 'F!’',C+/' '=F)OF 


records: 


records: 


A Replace LV, RPS, ARPS if layer field assigned: 


> (LAYEFLDS)10 

LV OFREPLACE TIE,9 
NFP OFREPLACE TIE,7 
RPS OFREPLACE TIE,8 
>(XDELJ JO 

ARPS OFREPLACE TIE,10 
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compress, replace field FLD for set SET: 


Ci] 
C2] 
C3] 
[4] 
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V 
A 
A 
A 


V 


Vv 


SDDvdDDDDIIDIOAIDIIDVIDID VID ITD VPIVIDDVDIIDOND 


A 


A 
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CWSID: MULTIFLO] 
R¢A FOR B 


Catenates arguments together, separating by a 
newline character. Used as: 
(FP,°7 7 2 4 0 3) EXECUTE 'F7¢F7/F2+F4' FOR 'F3>1' 
R¢A,OUTCNL,B 
C[WSID: MULTIFLO] 


AANFP€AAFP EXECUTE AAEXPR3; AAA; AAAFLDS ; AAALL; AAARPS 5 
AABAD; AABIT 3; AABLK$ AACMP 3; AADATA ; AADCMP 5; AADEL; AADISP3$ AAF 
; AAFILED; AAFILL; AAFLD;3 AAFLDS 3; AAFN1 3 AAFN2 3; AAFN3 3 AAFREC; 
AAFT 3; AAGOOD 3 AATINDS 3; AAINCR$ AAIND; AAINDS ; AALAB; AALAY 3; 
AALAYER; AALEAD; AALSETS ; AALV; AALVAR; AAM; AAMIN ; AAN; 
AANAFLD; AANDEL; AANF 3; AANFLD3; AANR3 AANSET $; AANSFLD3 AARPS 3 
AAS 3; AASD3;AASDISP3 AASEL3 AASET 3 AASETS 3; AASF LDS 3 AASIND3;3 AAT 
sAATIE3: AAVAR; AAVEC 3 AAW; AAWID; AAXEQ; 010 
Used as: (FP, 7 7 2 4) EXECUTE ‘'F7¢F7/[F2+F4’ or: 
(FP,°7 7 2 4 0 3) EXECUTE ‘'F7¢F7/F2+F4’' FOR 'F3>1’ 
Loops through active sets doing the following: 
reads the fields specified beyond the 0 in the 
left argument (calling them F3, F5, etc. for 
fields 3, 5, etc.); executes the character vector 
expression beyond the newline character (inserted 
by FOR) in the right argument; retrieves data 
from fields specified before the 0 (Cand positive) 
in the left argument (calling them F7, F2, etc. 
for fields 7, 2, etc.) for the records of that 
set which correspond to is in the resulting bit 
vector (or all active records if no selection 
expression provided); executes the character 
vector expression before the newline character in 
the right argument; replaces on file data for 
fields specified before the 0 (Cand negative) in 
the left argument (assuming their existence in 
variables F7, F8, etc. for fields “7, “8, etc.) 
for the records selected. All "Fn" variables are 
erased upon completion. Note that EXPR is 
executed in origin 1; e.g. 'F3[;21' always refers 
to 2nd column. 
DERROR( (1#peAAFP) V1<p9AAEXPR)/'RANK ERROR’ 
OERRORCO=1TOPAAEXPR)/'DOMAIN ERROR’ 
OLTO¢+1 
Extract 3 sets of fields from FP: 
AATE11+ C1 LIAAFP)LO 
AAFLDS€111 (AAT-1) 0AAFP 
AASFLDS¢AATIAAFP 
AAAFLDS¢€ | CAATCAAFLDS<0)/AAFLDS 
AAFLDS¢(~AAT) /AAFLDS 
Extract 2 expressions from EXPR: 
AAEXPRE , AAEXPR 
AAT¢AAEXPRLOTCNL 
AAXEQ¢ ( AAT-1) pAAEXPR 
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[37] AAEXPREAATLAAEXPR 

[38] DERROR(CCO=eAAFLDSIVCLIIZGAAFPIVOC’ 'A.=AAEXPR)#O=0 
AASFLDS)/'LENGTH ERROR’ 

C39] AAINCREAAFPL3 J] 

[40] AADEL¢€!AAFP(11] 

[41] AAFILL€AAFP[L11]<0 

[42] AAT¢AAFLDS, AAAFLDS , AASFLDS 

[43] OERROR( CA/AATELAAINCR) SAADELEAAT)/’' INVALID FIELD 
NUMBER’ 

[44] AATIE€AAFPLE2] 

[45] AAFT€OFREAD AATIE,4 

[46] A Width (no. columns) of fields: 

[47] AAWID€AAFT(13 ] 

[48] DERRORCOEAAWIDIAATI)/'INACTIVE FIELD’ 

[49] A Numbers of sets with records in wrong layer: 

[50] AABAD€10 

[51] A Exit if no records: 

[52] AANFP¢11eAAFP 

[53] 7%CXAAFPDCLIO]ILAAL2Z9 

[54] a Make field vectors distinct: 

[55] AATCAAINCREO 

[56] AATLAASFLDS]€1 

[57] AANSFLD<pAASFLDSEAAT/ LOAAT 

[58] AATL J€0 

[59] AATEAAAFLDS 1€1 

[60] AANAFLD<epAAAFLDS€CAAT/ LU OAAT 

C61] AATLI€<0 

[62] AATLAAFLDS J€1 

[63] AANFLD¢pAAFLDS€AAT/ LOAAT 

[64] A Names of fields to erase: 

C65] AAFN1€¢C~AASFLDSEAAFLDS ) /AASFLDS 

[66] AAFN1¢€6( CeEAAFN1) , 1) 0AAFN1 

[67] AAFNI€'F! ,C+/'’ '=AAFN1)DAAFNI 

[68] AAFN2€(C~AAFLDSEAAAFLDS)/AAFLDS 

C69] AAFN2¢€8(C COAAFN2) ,1)  0AAFN2 

[70] AAFN2€'F’ ,C+/' '=AAFN2)PAAFN2 

[71] AAFN3¢€6 CAANAFLD,1) 0AAAFLDS 

[72] AAFN3€/F’ ,C+/'’ '=AAFN3 )PAAFN3 

[73] A Label vector needed below based upon whether 

[74] a field in ws or on file when needed (file,ws): 

[75] AALAB€ CAAL1I7 ,AAL19)[1+AAFLDSEAASFLDS ] 

C76] AALAY¢AAFPI 1] 

[77] AADISP¢AAFPL[4] 

[78] AABLK¢AAFPLE5] 

[79] AAARPS¢€AARPS©CUFREAD AATIE,8 

C80] A Branch unless ARPS should be read: 

[81]  2%C=/AAFPI7 101) eAAL1 

[82] AAARPS€0O0FREAD AATIE,10 

[83] mA Consider only nonempty sets: 

[84] AAL1: AASETS€¢O0#AAARPS 

[85] a ...and sets specified in <layers>: 

[86] > CXAALAY)LAAL5 

[87] AAT¢xONC ‘layers’ 
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V EXECUTE (Ccontinued) 


[88] 

[89] 

[90] 

[91] 

[92] 

[93] 

[94] 

C95] 

[96] 

[97] 

[98] 

[99] 

[100] 
[101] 
[102] 
C103] 
[104] 
C105] 
C106] 
C107] 
C108] 
[109] 
[110] 
C111] 
[112] 
C113] 
C114] 
C115] 
C116] 
C117] 
[118] 
[119] 
[120] 
C121) 
C122] 
[123] 
C124] 
C125] 
[126] 
C127) 
[128] 
C129] 
C130] 
C131] 
C132] 
C133] 
[134] 
[135] 
C136] 
[137] 
C138] 
[139] 
C140] 


+ (AAT¥AALAY€AAAFLDS ) pAALS 


A Read layer values: 


AALV¢OFREAD AATIE,9 
~AATLAALS 


A Branch if matrix layers field: 


»*CAAWIDEAALAY1]>1) eAAL2 
AASETS€AASETSAAALVE layers 
2AALS5 


a Convert <layers> to matrix if not already: 
AAL2: O0ERRORCAAFT[13;AALAY]# 1Tolayers)/'LENGTH ERROR’ 


>(2=eplayers) eAAL3 


layers¢((x/"llolayers), “1Telayers)Jolayers 


A Branch if numeric matrix field: 
AAL3:>CO=1TOPAALV) CPAAL4 


AASETS©AASETSA(Clayers CMIOTA AALV)<leelayers 


2AALS5 
AAL4: AASETS¢€AASETSAV/AALVA. =Rlayers 
A Convert to indices; erase <layers>: 
AALS: AASETS©¢AASETS/ LP AASETS 
AANSET¢pAASETS 

AAT¢CDEX ‘layers’ 

A Last set index: 

AAS€O 

A Loop by nonempty set: 
AAL6:>CAASZAANSET) pAAL29 

Aa Current set index and number: 
AASfAAS+t1 
AASET¢CAASETSLAAS] 
A Displacement (Cno. 
AASDISP€EAADISP+AAINCRXAASET-1 

Aa Branch if deletion field unneeded: 
7+ CAANDELCAAARPS[ AASET]>0) pAAL7 

A Read deletion field for set SET: 
AABIT¢OFREAD AATIE,AADEL+AASDISP 

A Are all records in this set active: 
AAL7: AAALL€EAAARPS[ AASET]=AARPS[AASET] 
Aa Branch if a selection expression: 

+ ( XAANSFLD) eAALO 

A Branch if all records active: 
AAALLEAAL15 

A Branch if deletion field unneeded: 
> AANDELeP AALS 

A Indices of active records: 
AAINDS€AABIT/LPAABIT 

2AALI5 
AAL8&: AAINDS©€LAAARPS[ AASET] 

2AAL1I5 
A Last field index: 
AALS: AAF€O 

A Loop by selection field: 
AAL10: >CAAFZ2AANSFLD) PAAL13 

A Current field index and number: 
AAFCAAFtH1 
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EXECUTE Ccontinued) 


AAFLD€AASFLDSCAAF] 

A Read field FLD for set SET: 
AADATACOFREAD AATIE,AAFLD+AASDISP 

a Branch if all records in set active: 
+AAALLEAAL12 

Aa Branch if deletion field unneeded: 
AANDELeAALI1 

A Apply deletion field: 
AADATA€AABLITZAADATA 

~AAL12 

A Active records are leading records: 


AAL11: AADATA€ ( AAARPSL[ AASET1], 11) 0AADATA) PAADATA 
A Assign data to global variable Fn: 
AAL12:¢@'F’' ,C@AAFLD) , '’€AADATA’ 
%AAL10 
A Once all selection fields read, 
AAL13? AASELEPAAEXPR 
A Erase fields in SFLDS and not in FLDS: 
AAT€COEX AAFN1 
Aa Indices to retrieve (after squeezing deletions): 
AAIND©AASEL/ LO AASEL 
A Branch if some records selected: 
>CXpAAIND) eAALI14 
A Erase field variables; 
AAT¢COEX AAFN2 
AAT¢OEX AAFN3 
2>AAL6 
A Branch if all records on file selected: 
AAL14:%( AAALLECAAALLA ( pAAIND) =9AASEL) PAAL15 
A Indices to retrieve (before squeezing deletions): 
AAINDS¢AAIND 
aA Branch if deletion field unneeded: 
>AANDELeEAALI5 
A Reset INDS, considering deletion field: 
AAINDS€ (AABIT/ UP AABIT) [AAIND] 
Aa Last field index: 
AAL15: AAF€O 
A Loop by execution field: 
AAL16: > CAAFZ2AANFLD) pAAL22 
A Current field index and number: 
AAF©CAAF+1 
AAFLD¢CAAFLDS[LAAF] 
a Branch depending on whether already in ws: 
>AALABL AAF] 
AAL17: AADATACOFREAD AATIE, AAFLD+AASDISP 
>AAALLOAAL21 
A Branch if matrix field: 
>(CAAWID[AAFLDI1>1)eAAL18 
AADATA¢AADATAT AAINDS ] 
XAAL21 
AAL18: AADATACAADATAL AAINDS: J] 
~AAL2Z1 
AAL19: ®#AAALLOAAL1I6 


execute EXPR: 


get next set: 
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Vv 
£194] 
[195] 
C196] 
[197] 
C198] 
[199] 
C200] 
[201] 
[202] 
[203] 
[204] 
[205] 
[206] 
[207] 
[208] 
[209] 
[210] 
[211] 
[212] 
C213] 
[214] 
[215] 
C216] 
[217] 
[218] 
C219] 
[220] 
[221] 
[222] 
[223] 
[224] 
[225] 
C226] 
[227] 
[228] 
C229] 


[230] 
C231] 
C232] 
C233] 
[234] 
[235] 
[236] 
[237] 
[238] 
[239] 
[240] 
C241] 
[242] 
[243] 
[244] 
[245] 


EXECUTE (continued) 
AADATA€?!'F! ,dAAFLD 

A Branch if matrix field: 
>*CAAWIDLAAFLDI]>1) eAAL20 
AADATACAADATALAAIND] 
IAAL21 

AAL20: AADATAfCAADATAL AAIND; ] 


A Assign data to global variable Fn: 


AAL21:¢@'F!' ,CBAAFLD) ,'¢AADATA’ 
%AALI6 


FILE DESIGN AND 


UTILITIES 


A Once all execution fields read, execute XEQ: 


AAL22: SAAXEQ 


A Erase fields in FLDS and not in AFLDS: 


AAT¢COEX AAFN2 

a Last field index: 

AAF€0 

A Loop by assignment field: 
AAL23:9(AAFZAANAFLD) pAAL28 


A Current field index, number and width: 


AAFCAAFtH+1 
AAFLD¢CAAAFLDSLE AAF] 
AAW¢AAWID[IAAFLD] 
AACMP¢AATIE, AAFLD+AASDISP 
AADATA€#@'F!' ,SAAFLD 

A Branch if all records selected: 
SAAALLGAAL2Z5 

AAVAR¢CAADATA 

A Read field from file: 
AADATA¢COFREAD AACMP 

A Branch if matrix field: 
+(AAW>1) pAAL24 

AADATAL AAINDS ]©€AAVAR 
2AAL25 
AAL24: AADATAL AAINDS ; ]€AAVAR 


a Check that data is OK before filing: 
AAL25: DERROR( (2LAAW) #09AADATA)/'RANK ERROR’ 


DERROR( (AAW# 111,11 eAADATA) VAARPS[ AASET]#109AADATA) /' 


LENGTH ERROR’ 


OQERROR(C2#AAFT£2 3; AAFLD])#0=1TOPAADATA)/'DOMAIN ERROR’ 


A Replace data on file: 

AADATA DFREPLACE AACMP 

a Continue loop unless layer field: 
> (AALAYZ#AAFLD) pAAL23 
®AAALLLAAL2Z6 

AAVARCAADATA 

A Branch if matrix field: 
AAL26:>(AAW>1) pPAAL27 

Aa Continue if layer values ok: 
> CAAVARA. =AALVEAASET] ) pPAAL23 
A Else keep track of bad set: 
AABAD©¢AABAD, AASET 

2AAL2Z3 


AAL2Z7:32CA/AAVARA. =AALVIAASET; 1) 0AAL23 


AABADCAABAD, AASET 
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[246] 
[247] 
[248] 
[249] 
C250] 
[251] 
[252] 
[253] 
[254] 
[255] 
[256] 
C257] 
[258] 
[259] 
[260] 
[261] 
[262] 
[263] 
[264] 
[265] 
[266] 
[267] 
[268] 
[269] 
[270] 
C271] 
[272] 
C273] 
[274] 
[275] 
[276] 
[277] 
C278] 
[279] 
[280] 
C281] 
[282] 
[283] 
[284] 
[285] 
[286] 
C287] 
[288] 
[289] 
[290] 
[291] 
[292] 
[293] 
[294] 
[295] 
[296] 
[297] 


FILE DESIGN AND UTILITIES 


EXECUTE (continued) 

2AAL23 

a Erase fields in AFLDS: 

AAL2Z8: AAT€COEX AAFN3 

PAALG 

AAL29: AAT¢CDEX ‘layers’ 

a Exit if no sets with wrong layer recs: 

+ ( XAANSET¢9AABAD) 10 

AASETS¢AABAD 

A Indices of active fields: 

AAAFLDS€C XAAWILD) /LAAINCR 

A Exclude deletion field: 

AANF¢ oe AAAFLDS€ (AAAFLDS#ZAADEL) /AAAFLDS 

Aa Is layer field a vector field? 
AAVEC€AAWIDE AALAY ]=1 

A Last set index: 

AASIND€O 

A Loop by bad set: 
AAL30:%(AASIND2AANSET) pAAL64 

A Current set index and number: 
AASIND€AASIND+1 

AASET€CAASETSLAASIND] 

A Displacement (no. components) before this set: 
AASDISP©AADISP+AAINCRXAASET~-1 

a Branch if deletion field unneeded: 
> (AANDELCAAARPS[AASETI>0) 0AAL31 

A Read deletion field for set SET: 
AABIT¢OUFREAD AATIE, AADEL+AASDISP 

A Indices of active records: 
AAINDS€AABIT/UPAABIT 

PAAL32 

AAL31: AAINDS€LAAARPS[ AASET ] 

A Look at layer field: 

AAL32: AALVAR¢QOFREAD AATIE, AALAY+AASDISP 

A Branch if vector field: 
PAAVECPAAL33 

a Indices of recs with changed layer value: 
AABAD€ CAALVAR[ AAINDS 3; ]V.#AALVEI AASET;: 1)/AAINDS 
%AAL34 

AAL33: AABAD¢ CAALVAR[ AAINDS ] #AALVE AASET1)/AAINDS 
A Flag records filed so far: 

AAL34: AAFILED¢( pAABAD) 90 

A Branch if vector field: 

AAL3 5: 9AAVECPAAL3 6 

A Flag records with layer of lst unfiled rec: 
AALAYERCAALVAR[ AABADE AAFILED1U0] 31] 
AAGOOD€AALVAR[E AABAD; JA.=AALAYER 


a ...and sets with this record: 
AALSETS€AALVA. =AALAYER 
%AAL37 


AAL36: AALAYERCAALVAR[ AABADLAAFILEDi0] ] 
AAGOOD¢AALVAR[ AABADI=AALAYER 
AALSETS¢AALV=AALAYER 
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[298] 
[299] 
C300] 
C301] 
C302] 
[303] 
[304] 
C305] 
C306] 
C307] 
C308] 
C309] 
C310] 
[311] 
C312] 
[313] 
C314] 
C315] 
C316] 
C317] 
[318] 
[319] 
[320] 
[321] 
[322] 
C323] 
[324] 
C325] 
C326] 
[327] 
[328] 
[329] 
[330] 
(331) 
C332] 
[333] 
C334] 
[335] 
C336] 
[337] 
[338] 
[339] 
C340] 
[341] 
[342] 
C343] 
[344] 
[345] 
[346] 
[347] 
[348] 
[349] 
C350] 


FILE DESIGN AND UTILITIES 


EXECUTE (continued) 
a Update FILED; convert to indices: 
AAL37: AAFILED¢AAFILEDVAAGOOD 
AAGOOD€AAGOOD/ AABAD 
AANR¢p AAGOOD 
a Consider only non-full sets in this layer or 
A empty sets in any layer: 
AALSETS€( AAARPS=0) VAALSETSAAAARPS#ZAABLEK 
A Convert to indices: 
AALSETS©AALSETS/UPAALSETS 
A No. records filed so far: 
AAFREC¢O 
a Branch if no slots available in existing sets: 
AAL38:>CAABLKZAAMINEL /AAAf | AAARPS[E AALSETS ]) 0 AAL53 
a Branch if more than 1 set needed: 
AAT¢AANR-AAFREC 
> ( AAT>AABLK-AAMIN) eAAL39 
A Choose fullest set which will hold all recs: 
AAS€AABLK-AAA 
AAS€AALSETSLAASLUL/ CAASZAAT)/AAS] 
»>AAL40 
a Choose set with most empty slots: 
AAL39: AAS€CAALSETS [AAALAAMIN] 
A No. records to be inserted within the set: 
AAL40: AAMEAATLAARPS[E AAS 1-1 AAARPS[E AAS ] 
A No. records to be catenated within the set: 
AAN€ CAAT-AAM) LAABLK-AARPSLAAS] 
A Displacement (no. components) before this set: 
AASD€AADISPt+AAINCRXAAS+ 1 
a Branch if no deletion field: 
>CXAADEL)JJAAL41 
A Read deletion field for set S: 
AABIT€ONFREAD AADCMP€¢AATIE, AADEL+AASD 
Aa Branch if no records to be inserted: 
»>(XAAM)LAAL41 
A Indices of available insertion slots: 
AALINDS€AAMp (~AABIT) /LQAABIT 
A Next field index: 
AAL41: AAF€1 
A Loop by active field: 
AAL42:9(AAF>AANF) 0AAL47 
A Field number: 
AAFLD¢€AAAFLDSLAAF ] 
A Field width (no. 
AAWCAAWIDELAAFLD] 
A Look at original set: 
AAVAR¢COFREAD AATIE,AAFLD+AASDISP 
A Read field F for set S: 
AADATACO FREAD AACMP€AATIE, AAFLD+AASD 
a Branch if a matrix field: 
+(AAW>1) 0AAL44 
a Branch if no records to insert: 
>(XAAM) LAAL43 
AADATACAATINDS ]J€AAVAR[I AAGOODE AAFREC+t+LAAMI] 


columns): 
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V 
C351] 
[352] 
C353] 
[354] 
[355] 
[356] 
C357] 
C358] 
[359] 
[360] 


[361] 
C362] 
[363] 
[364] 
[365] 
[366] 
[367] 
[368] 
[369] 
[370] 
C371] 
[372] 
C373] 
[374] 
C375] 
[376] 
C377] 
[378] 
[379] 
C380] 
[381] 
C382] 
[383] 
C384] 
[385] 
C386] 
[387] 


[388] 
C389] 
[390] 
[391] 
[392] 
[393] 
[394] 
[395] 
[396] 
C397] 
[398] 
[399] 
C400] 
C401] 
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EXECUTE (continued) 

A Branch if no records to catenate: 

XC XAANIJLAAL46 

AAL43: AADATACAADATA, AAVAR[E AAGOODE CAAFREC+AAM)+1ULAAN] ] 
X~AAL46 

Aa Branch if no records to insert: 

AAL44:>(XAAM) LAAL45 

AADATALT AAIINDS 3; 1©€AAVARE AAGOODL AAFREC+lLAAM]I 3; ] 

A Branch if no records to catenate: 

>(XAAN)JJAAL46 

AAL45: AADATA©CAADATA, [IL JAAVARI AAGOODL[ CAAFREC+AAM)+1UAAN ] 
- J 
AAL46: AADATA OFREPLACE AACMP 

AAFeAAF+H+1 

»AAL42 

Aa Branch if no deletion field: 
AAL47: AALEAD€1 

>(XAADEL)JLAAL5O 

A Branch if no records to insert: 

>(XAAM) LAAL48 

A Turn active record bits on: 

AABITLAATINDS ]€1 

AALEAD€A/ AABIT=A\AABIT 

A Branch if no records to catenate: 

>(XAAN) LAAL49 

AAL48: AABITCAABIT, AANO1 

AAL49: AABLIT OFREPLACE AADCMP 

A Increment FREC by no. records added to this set: 
AAL50: AAFRECEAAFREC+AAM+AAN 

AARPS[ AAS 1]©¢AARPS[AAS]+AAN 

AANFPL9 J€AANFPL91+AAT¢O=AAARPSL AAS I 

A Replace layer value if set initially empty: 
PAATLAAL5S2 

A Branch if a vector layer field: 

SAAVECPAAL5S 1 

AALVLAAS; J€AALAYER 

PAAL52 

AAL51: AALVL AAS JEAALAYER 

AAI52: AAARPSTAASJ]€(0C 71 1) 01+AALEAD] XAAM+AAN+ | AAARPSEI AAS 
] 

AANFPL7J¢AANFPL71+AAN 

a Exit if all of data in field vars. 
+ ( AANR=AAFREC) epAAL61 

PAAL38 

A No. records to be appended in next set: 
AAL53: AAN€CAABLKL AANR-AAFREC 

A Next field number: 

AAFel] 

A Loop by field: 
AAL54:7%(AAF>AAINCR) PAAL60 
AAWCAAWIDLAAF J 

A Branch unless a latent field: 
>(CXAAW) pAAL5S5 

AADATA€10 


filed: 
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V 
[402] 
[403] 
C404] 
[405] 
[406] 
C407] 
C408] 
[409] 
C410] 
[411] 
C412] 
C413] 
C414] 
[415] 
C416] 
C417] 
[418] 
C419] 
C420] 
C421] 
[422] 
C423] 
[424] 
C425] 
C426] 
[427] 
C428] 
[429] 
[430] 
C431] 
[432] 
C433] 
C434] 
C435] 
C436] 
C437] 
C438] 
[439] 
[440] 
[441] 
[442] 
[443] 
[444] 
[445] 
[446] 
[447] 
C448] 
[449] 
[450] 
[451] 
[452] 
[453] 


EXECUTE (continued) 

PAAL5I 

A Branch unless it’s the deletion field: 

AAL55:>( AADELZAAF) PAALS6 
AADATA¢AANo 1 
2AAL58 

A Look at original set: 

AAL56: AAVARCOFREAD AATIE,AAF+AASDISP 

A Branch if a matrix field: 
>CAAW>1) epAALS7 
AADATACAAVARE AAGOODL AAFREC+1lLAAN] J 
2AAL5S8 

AAL57: AADATACAAVARL[ AAGOODL[ AAFREC+H+LAAN]J 3 J 

A Branch unless set must be padded to BLK records: 

AAL58: 2AAFILLIAALSYS 
AADATA¢ CAABLK, 1/0 AADATA) TAADATA 

AAL59: AATCAADATA OFAPPEND AATIE 
AAFCAAF+1 
%AAL5S4 

A Increment FREC by no. 

AAL60: AAFRECCAAFRECt+AAN 
AAARPS¢AAARPS , AAN 
AARPS¢AARPS , AATCAANT AABLKXAAFILL 
AANFPLE7 J€AANFP[7]+AAT 
AANFP[L6]€AANFP[6]+1 
AANFPL9J©€AANFP[9]+1 

A Catenate layer value: 

AALV¢AALV, CLJAALAYER 

A Continue unless all of data in field vars. 
+ ( AANR#AAFREC) 0AAL53 

A Branch if no data left to file: 

AALO61:9CA/AAFILED) LAAL35 

A 

A Branch unless deletion field exists: 
+(XAADEL) JAAL62 
AABIT¢OFREAD AACMP¢AATIE, AADEL+AASDISP 

A Turn off specified indices and replace: 
AABITCAABADJ€0 
AABIT OFREPLACE AACMP 

A Compute new no. records: 

AAN¢+/AABIT 
AAM©¢ I AARPSTAASET] 

A Reset parameters: 
AANFPL9]¢AANFP[9]-AAN=0 
AAARPS[AASETI€AANXC 71 1)£1+AAN=+/A\AABIT] 
»AAL30 

A 

a Turn off specified indices: 

AAL62: AAMCAARPS[ AASET] 

AABIT¢AAMo1 
AABITL[ AABADJ€0 

A Compute new no. 

AAN€+/AABIT 


records added to this set: 


filed: 


records: 
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VY EXECUTE (continued) 
[454] A Reset parameters: 
[C455] AANFP(9]©€AANFP[9]-AAN=0 
[456] AANFPE71¢AANFP[E7]+AAN-AAM 
[457] AARPS[AASETI]©¢AAN 
[458] A Last field index: 
C459] AAF¢€O 
[460] A Loop by active field: 
[461] AAL63:>(AAFZAANF) pAAL3 0 
[462] a Current field index and number: 
[463] AAF¢AAF+1 
[464] AAFLD¢AAAFLDSLAAF] 
[465] A Read, compress, replace field FLD for set SET: 
[466] AACMP€AATIE, AAFLD+AASDISP 
[467] (AABIT/OFREAD AACMP)JOFREPLACE AACMP 
[468] >AAL63 
[469] A 
[470] A Replace LV, FP, RPS, ARPS: 
[471] AAL64: AALV OFREPLACE AATIE,9 
C472] AANFP OFREPLACE AATIE,7 
C473] AARPS OFREPLACE AATIE,8 
[474] ~»>(€(xXAADEL)JO 
[475] AAARPS OFREPLACE AATIE,10 

4 


[WSID: MULTIFLO] 
V R¢A LAYERS B 
Used as: 


RINDS¢(CFP,KFLD)IOTA VALUES LAYERS 2 
RINDS¢€IOTARHO FP LAYERS Z 

RINDS¢SVEC SLASHIOTARHO FP,SFLDS LAYERS 2Z 
FP¢SVEC COMPRESS FP,SFLDS LAYERS Z 


A 

A 

A 

A 

re) 

A 
[7] A MAT¢SVEC SELECT FP,FLDS,0,SFLDS LAYERS Z 
[8] A SVEC SELECTWS FP,FLDS,0,SFLDS LAYERS Z 
C9] A CFP,XFLDS)EXECUTE XVEC LAYERS Z 
C10] A CFP,XFLDS,0,SFLDS)JEXECUTE XVEC FOR SVEC LAYERS Z 
C11] a 
[12] aA where Z is a vector (if FP[1] is a vector field) 
[13] A or a matrix (if FP[1] is a matrix field) of the 
[14] A larger values for those sets of records to be 
[15] aA considered in the operation being performed. The 
[16] A global variable <layers> is erased after performing 
[17] aA the operation (e.g. by IOTA or EXECUTE). 
[18] A 
[19] R¢A 
[20] layers¢B 

V 
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5. The APL*PLUS file utility functions listed above should be 
changed as follows to work on a SHARP APL system: 


A. Make the following direct replacements wherever they occur: 


APL* PLUS > SHARP APL 
OFREAD OREAD 
OFREPLACE OREPLACE 
OFAPPEND OAPPENDR 
OFNUMS ONUMS 

OTCNE DAVC156+0I0] 


B. Wherever ODERROR is used, replace the expression with an 
equivalent OSIGNAL expression. For example: 


OERROR (1#eeFP)/'RANK ERROR’ 
‘RANK ERROR’ OSIGNAL (Cl#opeFP)/599 


While not listed in this book, the SHARP APL versions of the file 
utility functions are available on disk. See the Postscript. 


Pus PN Be ee a A A Rs As Ae As ss A A ss A 


Unfortunately, the modifications required to make the file utility 
functions operational in APL2 are much more dramatic. Since files 
are limited to fixed length records rather than variable size 
components, we must take a different approach. 


One approach is to use a set of file access functions which emulate 
the behavior of the file access functions on APL*PLUS and SHARP APL 
systems. For example, the functions in the IBM public workspace 2 
VAPLFILE (CREATE, USE, SET, GET,...) perform such an emulation. When 
the file is created, you specify the record length, the number of 
records and the number of components. The utility functions break 
apart arrays which are too large for a single record and maintain 
directories to locate and reassemble the pieces of such arrays. If 
you take this approach, your most difficult tasks are choosing a 
record length and estimating the number of records the file will 
ultimately require. 


Another approach is to assume that a record length is chosen which is 
sufficiently large that any of the objects of the file may fit within 
it. Since the data components of a multi-set transposed file are 
limited to a specified block size (FP[51]), the maximum object size 
can be determined in advance. The maximum size of each data 
component is a function of its field type, its field width and the 
block size. Therefore, given the parameters of a file (FT and FP), 
you can determine a record length which is just long enough to 
accomodate any single file component. Further, if the maximum number 
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of records and sets are specified, you can compute the total number 
of records required. 


Given the record length and number of records, the file may be 
created with all of its records. These records may then be accessed 
directly (using BDAM) to read or replace objects. In fact, it isa 
simple matter to write functions FREAD and FREPLACE which behave like 
the APL*PLUS system functions OFREAD and OFREPLACE. 


Appending new records, however, is not so easy. Therefore, the file 
should be initialized with its maximum number of records, all of 
which are flagged deleted. The subsequent "catenation" of new 
records will actually be performed by replacing these deleted records. 


Let's pursue the latter approach. To implement it, you can do the 
following: 


A. Write a dyadic function NRECARECL whose left argument is FP, 
whose right argument is FT and whose result is the 2 element 
vector indicating the number of records and the record length 
required for the file. FP[6] must be the maximum number of 
sets needed, FP[7] the maximum number of records, and FP[11] 
the number of the deletion bit field as a negative number. For 


example: 
FT¢2 10012 111231231212000233224d1d1121 
FP¢5 987 10 50 2000 25 50000 700 77 
NL¢FP NRECARECL FT 
NL 
300 24020 


B. Write a dyadic function FCREATE whose left argument is the name 
of the file to be created, whose right argument is a 3 element 
integer vector: "tie number", number of records and record 
length. The file is created, and variables (e.g. CTL987, 
REC987) are shared with BDAM using the tie number to distinctly 
identify this file from other files. The records are appended 
Via QSAM or via the FMT option of BDAM. For example: 


‘'POLDATA.DATA’ FCREATE FP[2],NL 


C. Write functions FREAD and FREPLACE to emulate the APL*PLUS 
system functions OFREAD and OFREPLACE. Assume variables shared 
with BDAM using the tie number to distinctly identify the file 
from other files. 


D. Modify the INITFILE function (listed above for APL*PLUS 
systems) to require FP[11] to be negative and to fill the 
entire file (FP[6] sets) with "deleted" records. Use FREPLACE 
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in place of OFAPPEND since the file already contains all of its 
"components". For example: 


FP INITFILE FT 


E. Modify the APL*PLUS file utility functions (CATREC, CATRECWS, 
IOTA, IOTARHO,...) with the following direct replacements: 


APL* PLUS > APL2 
OFREAD FREAD 
OFREPLACE FREPLACE 
OTCNL OTCL1+O0TO) 
OERROR OES 


F. Modify CATREC, CATRECWS, INDEXA, INDEXWSA and EXECUTE to 
eliminate the "append new sets" logic since the file is 
initialized with the maximum needed number of sets. If all 


sets are full and more records are to be catenated, signal a 
FILE FULL error. 


G. Write functions FTIE, FUNTIE and FERASE to emulate the APL*PLUS 
system functions OFTIE, OFUNTIE and OFERASE. Share variables 


with BDAM using the tie number to distinctly identify one file 
from another. For example: 


FUNTIE FPE£2] 


While not listed in this book, the APL2 versions of the file utility 


functions (including NRECARECL, FCREATE, FREAD,...) are available on 
disk. See the Postscript. 
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BOOLEAN TECHNIQUES 


1. 

or 
2% 

or 
3. 
4. 


A/NVEC=[NVEC 

A/O=1LINVEC 

(Note: The first expression checks whether the elements 
are integers within comparison tolerance, i.e. OCT. The 
second expression checks whether the elements are exactly 


integers to full internal precision. Comparison tolerance 
is not meaningful for "0=".) 


COV\' ' FOCVEC)/CVEC 


C-+/A\' '=OCVECILCVEC 


(+/A\CMAT=' ')OCMAT 


MAP¢INPUTE ’ 0123456789" 
+/MAP>"“110,MAP 
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[WSID: TIMING] 
VY TIMEADEFINE AANAME;01I03;AAALP; AAAN; AABL; AAI 3 AAN; AANL; 

AANUM; AAPAN ; AAPBL3; AAPNUM3 AAT; AATCNL; AAVR 

C1] OLO¢21 

[2] aA Remove blanks from fn name: 

[3] AANAME€ CAANAME#!’ ')/AANAME 

C4] A APL*PLUS: 

[5] DOERROR(C340NC AANAME)/'NOT A DEFINED FUNCTION’ 

f6] ONERROR(V/C,’*' ,OSTIOSS '*' ,AANAME,'[')/' FUNCTION 
SUSPENDED ' 

{7 ] DERRORCO=eAAVR¢OVR AANAME)/' FUNCTION LOCKED’ 

C8] AATCNL¢UTCNL 

[9] A SHARP APL: 

[10] A ‘NOT A DEFINED FUNCTION’ OER (3740NC NAME) 0599 

[11] A Te2 OWS 2 A STATE INDICATOR 

[12] a 'FUNCTION SUSPENDED’ OER (v/(((1leeT),1+oNAME)TT)A. = 
NAME,'[’}9599 

[13] Aa 'FUNCTION LOCKED’! OER (CO=oVR¢e1 OFD NAME)0599 

[14] A TCNL¢DAVL157] 

[15] a Flag newline chars (Boolean partition vector): 

C16] AANL€”~ 10AAVR=AATCNL 

C17] a Flag digits: 

C18] AANUM¢CAAVRE '0123456789' 

[19] aA Pole vector of contiguous digits: 

C20) AAPNUM¢AANUM# “10AANUM 

[21] a Flag char following ] after line no.: 

[22] AANL€-1OAAPNUM\ ~“10AAPNUM/ “1OAANL 

[23] AAPNUM¢O 

[24] A Flag blanks: 

C25] AABL€EAAVR=' ' 

[26] aA Pole vector of contiguous blanks: 

[27] AAPBLEAABL# “19AABL 

[28] a Flag 1st nonblank char in each line: 

[29] AANL€CAANL>AABL) VAAPBL\ “19AAPBL/AANL 

[30] AABL¢AAPBL€O 

(311 A Flag letters: 

C32] AAALPCAAVRE 'ABCDEFGHIJKLMNOPORSTUVWXYZAabcdefghijklmno 
pqrstuvwxyza' 

[33] a Flag digits or letters: 

C34] AAAN¢AAALPVAANUM 

[35] AANUM¢O 

[36] A Pole vector of contiguous digits/letters: 

[37] AAPANCAAAN# ~10AAAN 

[38] AAAN€O 

[39] A Pole vector of identifiers: 

[40] AAPANCAAPAN\AATYV 1OAATEAAPAN/ AAALP 

[41] AAALP¢0O 

[42] A Pole vec of identifiers at start of line: 

[43] AAPAN¢AAPAN\AATV” 1OAATCAAPAN/ AANL 

[44] Aa Pole vector of labels: 

C45] AAPANCAAPAN\AATVI1OAATC! 2’ =AAPAN/AAVR 

[46] A Flag 1st nonblank after label or at start: 

[C47] AANL¢CAANL>AAPAN) V “1OAAPAN\ ~10AAPAN/AANL 
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[48] 
C49] 
[50] 
[51] 
C52] 
[53] 
[54] 
[55] 
[56] 
[57] 
[58] 
[59] 
[60] 
C61] 
[62] 


C1] 
[2] 
[3] 
C4] 
[5] 
C6] 
[7] 
C8] 
[9] 
C10] 
[11] 
[12] 
[13] 
[14] 
[15] 
C16] 
C17] 
C18] 
[19] 
[20] 
[21] 
[22] 
[23] 
[24] 
[25] 
[26] 
C27] 


V TIMEADEFINE (continued) 
AAPAN¢O 
A Expand VR for 2 more chars on each line (A0): 
AAVR€(C1+AANL+AANL) /AAVR 
OLO<0 
AAIT©LAAN€+/ AANL 
A Insert the 2 chars: 
AAVREI ( CAANL/LOPAANL) +AAI+AAT)°.+ O LI€CAAN,2)0'AO' 
AAI€¢0O DEF AAVR A APL*PLUS 
A Add a last fn line too: 
AAI©ODEFL AANAME,'C',C@AAN+1),' ] 
A I¢3 OFD (CC 61VR),'0C',C61++/NDL),' 
A Initialize stopwatch var: 
AM¢(AAN,4)0 0 0 ,(L/10),0 
Aa Last line run: 
AL€O 
V 


A' A APL*PLUS 
JAV' A SHARP APL 


[WSID: FNIDS] 

V IDENTIFY VR;3;ALP;AN;ARGS ;AVAR;BL;COLS;HDR; IDLEN;IDS;3 
IDSTART; IND; LAB; LEN; LOC;N;NC;NCCON;SNCMT;5NID;3SNL3;NQ;NUM; 
PAN ; PARSE; PBL; PID; PIDA; PLAB; PNUM; PSID;R;RESULT;RVAR;S; 
START ;T;TCNL;0I0 

A Evaluates the vector representation (VR) of a 
A function and displays all possible inconsistencies 
A involving the use of identifiers. 
A Requires subfunction: CMIOTA. 
OLTO<0 
A Construct a scalar newline character: 
TCNL¢OTCNL 
A TCNL¢UTCLE1] Aa APL2 
A TCNL¢OAVEC156] A SHARP APL 
A Where does header end? 


TEVRULUTCNL 

A Grab header of fn (less newline): 
HDR¢TpVR 

Aa Drop header from vis rep: 
VR¢ETIVVR 

A Where does fn syntax end? 
T©HDRv! 3’ 

A Localized vars: 

LOC¢TLJHDR 

A Drop local vars: 
HDR¢ToHDR 


A Drop leading junk: 
HDR¢CV\~HDRE’ V')/HDR 

A Is there an explicit result? 
T<CoHDR)J>IND«HDRu'<’ 

A Explicit result (if any) 
RESULT¢ (TXIND) oHDR 
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V IDENTIFY (continued) 
[28] A Remove result from header: 
[29] HDRe’ ',CTXIND+1)/HDR 
[30] a Starting indices of fn name, args: 
[31] START¢1+(CHDR=' ')/tLeHDR 
[32] A Lengths of names: 
[33] LEN€C1ilSTART,1+eHDR)-1+START 
[34] A No. of args: 
[35] Te -1+oLEN 
[36] aA Indices of args (if any): 
[37] NepINDe( (T=2)00),(xTIET 
[38] A Don’t consider fn name: 
[39] START<¢STARTCIND] 
C40] LEN¢LENCIND] 
[41] A Length of longest arg: 
[42] CoLs¢f/0,LEN 
[43] A Raveled, blank mat of args: 
[44] ARGS€(NxCOLS)o’ ' 
[45] a Fill them in (T¢MONIOTA LEN): 
[46] T¢T+LpTeLEN/-~110,+\LEN 
C47] ARGS([T+LEN/COLSXLNIJ¢HDR[IT+LEN/START] 
[48] A Reshape to mat of args: 
[49] ARGS¢(N,COLS ) eARGS 
[50] aA Flag newline chars (Boolean partition vector): 
[51] NL€ 1¢9VR=TCNL 
[52] Aa Flag nonquotes: 
C53] NQ¢VR#!'!'!’' 
[54] A Map of chars not in quote pairs (i.e. char constants) 
[55] a within each fn line (NCCON¢NL pEQSCAN NQ): 
[56] NCCON¢€=\NQOANLAT#  110,TE~NL/=\"111,NQ 
[57] NQ<¢0 
[58] a Flag non-aA chars (includes as in quotes): 
[59] NCe€NCCONAVR='A' 
[60] a Map of chars which do not follow a a Cignoring as 
[61] A within quotes) within each fn line. as are flagged 0. 
C62] A CNCMT¢NL pANDSCAN NC): 
[63] S€NL2NC 
[64] NCMT¢~4\S\T#F°110,T¢€~S/NC 
[C65] S¢T¢NC¢O 
[66] A Map of chars which are not included within as or '’: 
[67] PARSE¢NCMTANCCON 
[68] VR¢PARSE/VR 
[69] NL¢PARSE/NL 
[70] PARSE€NCCON¢|+NCMT€0O 
[71] aA Flag digits, letters, blanks: 
[72] NUM¢VRE!0123456789' 
C73] ALP¢VRe’ABCDEFGHIJKLMNOPQRSTUVWXYZAabcdefghijklmnopars 
tuvwxyZA’ 
[74] BL€VR=' ' 
[75] aA Flag alphanumeric chars: 
C76] AN€¢NUMVALP 
[77] A Pole vec of contiguous digits: 
C78] PNUM¢NUM#~110,NUM 
C79] NUM¢O 
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V IDENTIFY Ccontinued) 
[80] A Pole vec of contiguous digits/letters: 
C81] PAN¢AN#°110,AN 
[82] AN+¢<O 
[83] A Pole vec of identifiers: 
[84] PID¢PAN\TV 190T¢PAN/ALP 
C85] ALP<«PAN¢0O 
[86] a Flag '0D’ before identifiers (Onames): 
[87] T¢1OPID 
C88] TeT\'O'=T/VR 
[89] Aa Shift leading poles of Onames to include D: 
[90] PID¢TVPID> 160T 
C91] T¢0O 
[92] A Flag char following ] after line no.: 
[93] START¢€ 1®PNUM\ -10PNUM/ “10NL 
[94] NL€¢PNUM¢O 
[95] A Pole vec of contiguous blanks: 
C96] PBL€BL4°~110,BL 
[97] A Flag 1st nonblank char in each line: 
[98] START¢€CSTART>BL)VPBL\ 19PBL/START 
[99] BL¢PBL¢O 
[100] aA Pole vec of identifiers at start of line: 
[101] PSID¢PID\TVY 10T¢PID/START 
[102] START<€0O 
C1031 A Pole vec of labels: 
[104] PLAB¢PSID\TV1OT¢’:'=PSTD/VR 
C105] PSID<O 
C106] A Pole vec of direct assignment identifiers: 
[107] PIDA¢PID\Tv10T¢<’<'=PID/VR 
C108] A Start and end (+1) indices of identifiers: 
[109] IND¢PID/tePID 
[110] A No. of identifiers: 
[111] NID¢(pIND)+2 
C112] IND¢C(NID,2)eIND 
[113] a Start indices of identifiers: 
[114] IDSTART<IND[ ;0] 
C115] A Lengths of identifiers: 
C116] IDLEN¢IND[E :11]-IDSTART 
[117] A Starting indices of local vars: 
C118] START¢1+(CLOC='3')/teLOC 
[1191] A Lengths of local vars: 
C120] LEN¢C1lSTART,1+oLOC)-1+START 
[121] A Length of longest ident.: 
[122] CoLs¢(Cf/LEN)S Cf /IDLEN) IT (eRESULT)S1leARGS 
C123] A Pad arg names to conform: 
C124] ARGS¢CCloeeARGS) ,COLS) TARGS 
[125] A 0 row mat if no result: 
C126] RESULT¢( (xpRESULT) , COLS) epeCOLSTRESULT 
[127] A Raveled blank mat of local vars: 
[128] T«CCOLSxXN¢pSTART)e’ ' 
C129) A Fill them in (S€MONIOTA LEN): 
[130] S¢S+rieS¢LEN/-~110,+\LEN 
[131] Tl(S+LEN/COLSxXitN]©LOC[ES+LEN/START] 


-~453- 


Chapter 15 Solutions 


V 
[132] 
[133] 
[134] 
[135] 
[136] 
C137] 
C138] 
[139] 
[140] 
[141] 
[142] 
[143] 
[144] 
[145] 
[146] 
C147] 
[148] 
[149] 
[150] 
[151] 
[152] 
C153] 
[154] 
[155] 
C156] 
C157] 
[158] 
[159] 
C160] 
[161] 
C162] 
[163] 
C164] 
[165] 
[166] 
C167] 
C168] 
C169] 


(170] 
[171i] 
L172] 
C173] 
C174] 
C175] 
{176] 
C177] 
C178] 
Li79 J 
C180] 
C181] 
C182] 


BOOLEAN 


IDENTIFY (continued) 

A Reshape to mat of local vars: 
LOC<(N,COLS ) oT 

A Raveled blank mat of identifiers: 
T¢CCOLSXN¢oIDSTARTIe’ ' 

Aa Fill them in (S¢MONIOTA IDLEN): 
S¢S+tleS€IDLEN/-~110,+\IDLEN 
TCS+IDLEN/COLSXLN]©<VRIS+IDLEN/IDSTART] 

a Reshape to mat of identifiers: 
IDS€(N,COLS) eT 

A Mat of label names: 
LABe(S¢C(N,2)oPID/PLAB)£3;01])4IDS 
PLAB€¢0O 

A Mat of assigned vars: 

AVAR¢ (T¢€C(N,2)oPID/PIDA)£301)/4IDS 
PID¢PIDA¢0 

A Mat of referenced vars: 

RVAR¢ (S¥T)4IDS 

A Make distinct: 

AVAR¢(CCAVAR CMIOTA AVAR)=tleepAVAR) ZAVAR 
RVAR¢CCRVAR CMIOTA RVAR)=tleeRVAR) #RVAR 

A 

A Flag distinct labels: 
+(A/TE(LAB CMIOTA LAB)=ileepLAB)oeL1 


Ne’ '#S¢,' ',C+T)ALAB 
LAB¢TALAB 
O<+'’REDUNDANT LABEL: ',1/CNV10N)/S 


A Flag distinct locals: 


L1: Re’! 

+(A/T€(LOC CMIOTA LOC)=lilppLOC) eL2 
Re,’ ',(€~*T)FLOC 

LOC¢€T#LOC 
L2:7%C€v/T¢CloopRVAR)J=RVAR CMIOTA LOC)IJL3 
Ne' '#S85¢,' ',TALOC 

O¢’ IDENTIFIER LOCALIZED BUT NOT USED: 
A Any unassigned local vars? 


L3:9CV/T€CLleepAVAR)<AVAR CMIOTA LOC)1L4 

Ne’ '#S5¢,' ',TALOC 

O«’' IDENTIFIER LOCALIZED BUT NOT ASSIGNED: 
/S 

A Include result and args in local vars: 
L4: LOC¢ (RESULT, COJARGS) ,£01]L0C 

A Flag distinct locals: 

2CA/T€CLOC CMIOTA LOC)J=ilee LOC) eL5 

R¢eR,,' ',€*T)4LOC 

LOC¢T#LOC 
L5:>(xoR)LL6 

Ne’ '#R 

O<«’REDUNDANT LOCAL VARIABLE: 
A Any localized labels? 
L6:>(Cv/T€CleepLOC)>LOC CMIOTA LAB)JL7 
Ne” "#5¢,' *,'T/LAB 
O<+'’ LOCALIZED LABEL: 


',1LCNVION)I/R 


',1LCNV1ON)/S 
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IDENTIFY Ccontinued) 
A Any unreferenced labels? 
L7:7>C€v/T¢€CloeeRVAR)=RVAR CMIOTA LAB)JL8 

Ne’ '#S5¢,' °',TALAB 

O<’UNUSED LABEL: ',1/0NV10N)/S 
a Any assigned labels? 
L8:%C€V/T¢CLloepAVAR)>AVAR CMIOTA LAB)JL9 

Ne’ '#S¢,'’ ',TALAB 

D<’ASSTGNED LABEL: ',1/°0NV10N)/S 

A Any unlocalized assigned vars? 
L9:7>(vV/T¢CleeLOC)=sLOC CMIOTA AVAR)1IL10 

Ne’ '#S¢,' ’',TAAVAR 

O<' IDENTIFIER ASSIGNED BUT NOT LOCALIZED: 
/S 

A Any unlocalized referenced vars? 
L10:9(V/Te( (lee LOC)+lepLAB)<(LOC,(01LAB)CMIOTA RVAR)J! 
Lit 

Ne’ '#S5¢,' ',TARVAR 

O<«'TDENTIFIER USED AND NOT LOCALIZED: ',1!/0NV1I0N)/S 
L11:2(V/(1leeAVAR)<AVAR CMIOTA RESULT)1L12 

O<’RESULT NOT ASSIGNED: ',,RESULT 
L12:7CV/T¢CloepRVAR)=RVAR CMIOTA ARGS)JO 

Ne’ '#S5¢,' ',TAARGS 
O<’ARGUMENT NOT USED: 


',1LCNV10N) 


',ALCNVIONI/S 


CWSID: FNIDS] 
R¢KEEP LOCALIZE VR;ALP;AN;ARGS;COLS;HDR; IDLEN;1IDS; 
IDSTART; IND; KIDS ; KLEN ; KSTART ; LLEN ; LOCAL;N;NB;NC;NCCON; 
NCMT ;NID;NKEEP;NL;NQ;NUM; PAN; PARSE; PID; PIDA;RESULT;S;T 
;TCNL;OL0 


[1] a Modifies the vector representation (VR) of a 

[2] a function such that its header contains as localized 
[3] A variables all and only those variables which are 
[4] A assigned within the function. KEEP is a character 
[5] a matrix or vector (blank delimited) of names of 

[6] A variables to not localize though assigned, or to 
[71 aA localize though not assigned. 

[8] aA Requires subfunction: CMIOTA. 

[9] OLO<0O 

[10] A Construct scalar newline character: 

C11] TCNL¢OUTCNL 

C12] A TCNL¢€OTCL1] A APL2 

C13] A TCNL€UAVE156] A SHARP APL 

[14] A Flag newline chars (Boolean partition vector): 

C15] NLe 1O0VR=TCNL 

[16] a Flag nonquotes: 

C17] NQ¢VR#'''!' 

[18] aA Map of chars not in quote pairs (i.e. char constants) 
[191 aA within each fn line (NCCON¢NL pEQSCAN NQ): 

[20] NCCON¢=\NOANL\T# 110, T€E*NL/=\7111,N0 

[21] NQ<«O 
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V LOCALIZE (continued) 
[22] a Flag non-a chars Cincludes as in quotes): 
[23] NC¢NCCONAVR=' A’ 
[24] A Map of chars which do not follow aa (ignoring as 
[251 aA within quotes) within each fn line. as are flagged 0. 
[26] A CNCMT¢NL pANDSCAN NC): 
C27] S€NL2ZNC 
[28] NCMT¢~4\S\T#F 110,T¢€~S/NC 
C29] S€¢TeNC¢eO 
[30] A Map of chars which are not included within as or '': 
[31] PARSE¢NCMTANCCON 
[32] NCCON¢NCMT<0 
[33] aA Flag digits, letters: 
[34] NUM¢PARSEAVRE !’0123456789' 
[35] ALP¢PARSEAVRE ’' ABCDEFGHIJ KLMNOPORSTUVWXYZAabcdefghijklm 
nopqrstuvwxyZa’ 
[36]  PARSE¢O 
[37] A Flag alphanumeric chars: 
[38] AN¢NUMVALP 
[39] NUM¢0O 
[40] A Pole vec of contiguous digits/letters: 
[41] PAN¢AN#° 110,AN 
C421 AN¢€O 
[43] a Pole vec of identifiers: 
[44] PID¢PAN\TY “10T<«PAN/ALP 
[45] ALP¢PAN¢O 
[46] a Flag '0’ before identifiers (Unames): 
[47] T¢10PID 
[48] T¢T\'O'=T/VR 
[49] aA Shift leading poles of Unames to include 0: 
[50] PID¢TVPID>"14T 
[51] T¢o 
[52] aA Pole vec of direct assignment identifiers: 
[53] PIDA¢PID\Tv1¢T¢’<¢'’=PID/VR 
[54] a Start and end (+1) indices of identifiers: 
[55] IND¢PID/tePID 
[56] A No. of identifiers: 
[57] NID¢CeIND)+2 
[58] IND¢(NID,2)eIND 
[59] aA Start indices of identifiers: 
[60] IDSTART¢INDLE 50] 
[61] a Lengths of identifiers: 
C62] IDLEN¢IND[ 3;1]-IDSTART 
[63] A Map vec of nonblanks in exception identifiers: 
[64] NBe’ '#KEEP¢€,’ ', KEEP 
[65] a Start indices in KEEP of identifiers: 
[66] NKEEP<¢oKSTART¢€ (NB>~190NB)/LeNB 
[67] A Lengths of KEEP identifiers: 
C68] KLEN¢C1+CNB>10NB)/tLoNB)-KSTART 
C69] A Length of longest identifier: 
[70] COoOLS¢Cf/KLEN) [lf /IDLEN 
C71] a Raveled blank matrix of identifier names: 
[72] IDS€(NIDXxCOLS)o’ * 
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V LOCALIZE C(Ccontinued) 
C733] a Fill them in (T¢MONIOTA IDLEN): 
[74] T¢eT+r.eT¢IDLEN/-~110,+\IDLEN 
[75] IDS(T+IDLEN/COLSxXtLNIDI€VRIT+IDLEN/IDSTART ] 
[761 A Reshape to mat of identifiers: 
{77] IDS¢CNID,COLS)eIDS 
[78] A Prepare to look at header syntax: 
[79] T€¢VRLTCNL,' 3’ 
[80] A Take up to local vars: 
[81] HDR¢C(L/T)eVR 
[82] A Drop leading junk: 
[83] HDReCv\~HDRE’ V')/HDR 
[84] T¢CoHDR)>IND¢HDRu’ ¢’ 
[85] A Explicit result (if any): 
[86] RESULT¢<C(CT,COLS ) op COLSTINDeHDR 
[871 A Remove result from header: 
[88] HDR¢CTXIND+1)lHDR 
C89] A No. of arguments: 
[90] S¢+/' '=HDR 
[91] A Arguments (if any): 
[92] ARGS+IDS[T+((S=2)00),(xS)0S:] 
[93] Aa Mat of assigned identifiers: 
[94] LOCAL€CCCNID,2)o0PID/PIDA)£;3;01])4IDS 
[95] PID«PIDA<O 
[961 A Remove redundancies: 
C97] LOCAL€CCLOCAL CMIOTA LOCAL)=tlee LOCAL) #4LOCAL 
[98] A Raveled blank matrix of exception identifiers: 
[99] KIDS€¢C(NKEEPXCOLS)o0' ' 
[100] a Fill in (T€MONIOTA KLEN): 
C101] TeT+ieT¢KLEN/-~110,+\KLEN 
[102] KIDSLT+KLEN/COLSxUNKEEPJ©KEEPCET+KLEN/KSTART ] 
[103] A Reshape to mat of exception identifiers: 
[104] KIDS¢CNKEEP,COLS)eKIDS 
[105] A No. of syntax local vars: 
[106] N¢leeT¢RESULT, [LOJARGS 
[107] A Squeeze out identifiers in KEEP (globally assigned) 
[108] A or syntax local vars: 
[109] LOCAL€CS€¢(CN+NKEEP)=IND€CKIDS ,COITICMIOTA LOCAL) #4LOCAL 
[110] A Include identifiers in KEEP which are not assigned 
[111] A Cassigned within subfns): 
[112] T€C1+N+NKEEP)o1 
[113] MTLINDIJ€0O 
C114] LOCAL¢LOCAL, [0] (NKEEPeT)/KIDS 
[115] a Place local vars in sorted order: 
C116] LOCAL¢€LOCALCOUAVALOCAL; J 
C1171 A Lengths of local vars: 
[118] LLEN€¢+/LOCAL#’ ' 
[119] A Indices after each header local semicolon: 
C120) T¢+\1+0,LLEN 
[121] a Initialize local var segment of header: 
[122] HDRe(-1+-1TT)e';' 
[123] A Insert (S¢€MONIOTA LLEN): 
[124] S¢S+rieS¢LLEN/-~110,+\LLEN 
C125] HDRIS+LLEN/ “1lT]¢C, LOCAL) [S+LLEN/COLSx1t eLLEN] 
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VY LOCALIZE Ccontinued) 
[126] T¢VRUTCNL,’ 3! 
[127] R€CCL/TIeEVR) ,HDR,TLOJLIVR 
V 


CWSID: COMMENTS ] 
VY R¢UNCOMMENT VR;BL3;NC3;NCCON s;NCMT;NL3;NQ;NUM; PARSE; PBL; 

PNUM;S;START;T 
[1] aA Modifies the vector representation (VR) of a 
[2] a function such that its comments are removed (but 
C3] aA the lamp symbols remain for full-line comments). 
[41 aA Flag newline chars (Boolean partition vector): 
C5] NL¢ 109VR=-OTCNL 
C6] aA NL©€ 1O0VR=OTCL1+0I0] A APL2 
C7] A NL¢-10VR=OAVE156+01I0] aA SHARP APL 
[8] aA Flag nonquotes: 
[9] NQ¢VR#!''’' 
[10] A Map of chars not in quote pairs (i.e. char constants) 
[11] A within each fn line (NCCON¢+NL pEQSCAN NQ): 
[12] NCCON¢=\NQOANL\T#Z 110, T€¢~NL/=\7111,NQ 
[13] NQ¢0 
[14] A Flag non-A chars (Cincludes as in quotes): 
[15] NC¢+NCCONAVR='A’ 
[16] Aa Map of chars which do not follow a a (ignoring as 
C171 A within quotes) within each fn line. as are flagged 0. 
C18] aA CNCMT¢+NL pANDSCAN NC): 
[19] S¢NL2NC 
[C20] NCMT¢~4\S\T#47110,T¢~S/NC 
[21] S¢T¢Nc¢O 
[221 A Always include newline chars (extend maps left by 1): 
[23] NCMT¢NCMTV1ONCMT 
[24] A Map of chars which are not included within as or ’'’: 
[25] PARSE¢NCMTANCCON 
[26] NCCON¢€0O 
[27] A Flag digits, blanks: 
[28] NUM¢PARSEAVRE’ 0123456789! 
[29] BL¢PARSEAVR=' ' 
[30] PARSE¢0O 
[31] A Pole vec of contiguous digits: 
[32] PNUM¢NUM# -110,NUM 
[33] NUM¢0O 
[34] aA Flag char following ] after line no.: 
[35] START¢”~1O0PNUM\ ~1¢0PNUM/ ~10NL 
[36] NL€PNUM¢O 
[37] A Pole vec of contiguous blanks: 
C38] PBL€BL4-110,BL 
C39] A Flag ist nonblank char in each line: 
[40] START¢(START>BL)VPBL\ 10PBL/START 
[41] BL€PBL€O 
[42] aA Squeeze out cmnts except cmnt symb for full line: 
C43] R¢CNCMTVSTART)/VR 

V 
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IRREGULAR ARRAYS 


or 


AVGee(+/"SALES)+o° ‘SALES (APL2) 
AVG+, T(+/" SALES )+o0 “SALES (APL*PLUS) 
Te ,0S>SALES (SHARP APL) 


AVG¢€(+/TTS>SALES )+T 


AVG¢ (SALES NNSUMCOLS 1 1)+NNLEN SALES (Conventional APL) 


UNAMES¢ ( C(NAMESUNAMES ) = top NAMES ) /NAMES C(APL2, APL*PLUS, 


SHARP APL) 


UNAMES¢NAMES CNIDX CCNAMES CNIOTA NAMES )= 


LITNAMES )/L1ITNAMES (Conventional APL) 


SORTED¢NAMES CNIDX UAV CNGRADEUP NAMES (Conventional APL) 
UNAMES¢SORTED CNIDX (€°111,~SORTED CNEQ 


DDDODIDIIOIO OO 


SORTED CNIDX 10lL1iTSORTED)/L1TSORTED 


C[WSID: CNFNS] 


V RECNEST MsASB;sLsNsSsE;1r;010 


CNEST is used to convert a character vector or 
matrix into a character nest. If a character 
vector, the first character is taken as the 
delimiter; each set of characters between 
delimiters defines one segment. If a character 
matrix, each row (less the trailing blanks) 
defines one segment. The rows of a character 
matrix are assumed to be left justified, i.e. 
to have no blanks to the left of the leftmost 
nonblank character. 

Branch if scalar or vector: 


>2(12=0eM) 0 L2 
OLO<0 
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VY CNEST (continued) 
[14] A Insure even no. cols: 
[15] 2»C0O=211LleM)eL1 
[16] MeM,’ ' 
[17] A Lengths: 
C18] Li: Le+/v\' '4o0M 
[19] a Cvt to int mat, incl lengths: 
[20] M¢L,163 ODR M 
[21] A No. elts to take from each row, incl length: 
[22] NepLe1+lL+2 
C23] a Start elts of raveled mat: 
[24] S€¢C1lloM)x.N 
[25] A R€EMONIOTA L: 
[26] R€R+teR¢€L/-7~110,+\L 
[27] R€EC(N,C€+\7110,L)+1+N),0,M)CR+t+L/S] 
C28] 20 
[29] a Word starts,lengths: 
[30] L2:010¢1 
[31] BeM=1TM 
[32] N¢oeS¢B/teB 
C33] L¢e°-1+(1lS,1+eB)-S 
[34] Tet+\ 1TLC1+N) , E€1+Bef L=2 
[35] A 8224=163 ODR ’ ns 
[36] OD1IO¢O 
[37] R¢(N,I),E/8224 
[38] RLLI¢L 
C39] R¢82 ODR R 
[40] QO1o0¢l1 
[41] A A¢MONIOTA L: 
[C42] A¢At+r.eAtL/-"1L10,+\L 
[43] RCA+L/2+I+LJ¢MCA+L/S] 
[44] R¢163 ODR R 
V 


CWSID: CNFNS] 
V REeW CNAM M3sJ3L3;N;S3;X;U010 
CNAM is used to convert a character nest (M) toa 
character matrix (R) with W columns and one row 
per segment. If W is 10, the matrix will have as 
many columns as the longest segment in M. If W 
has two elements, the first is the number of columns 
of the result and the second is the justification 
indicator: 1 (left justify each segment within its 
row); 2 (right justify each segment within its row); 
3 (center each segment within its row). If the 


= 
~] 
had 
DDDBDDDAMAIIDID VID DIVO D DO 


[10] second element is omitted, a value of 1 (left 
C11] justify) is assumed. If the first element is 
C12] negative (e.g. ~1), the matrix will have as many 
C13] columns as the longest segment in M. 

C14] J€1l 2 3 (L,R,C): 


[15] Je1tc1ilw),l 
[16] A Handle char vec or scalar arg: 
[17] 3(82#0DR M)oL1 
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[18] 
[19] 
C20] 
C21] 
C22] 
C23] 
[24] 
[25] 
[26] 
C27] 
C28] 
[29] 
[30] 
[31] 
C32] 
C33] 
[34] 
[35] 
[36] 
C37] 


V CNAM (continued) 


L¢eoeMe ,M 

Wel1lTtTCCCO=o ,WIVO>1TW)/L) .W 

R¢EwtM 

R¢C1,WieCCJ#1)x0OFL CW-L)+1°3-1)0R 
»>0 

A No., starts, 
L1:0I0¢1 
X¢LNeM[OIOI 
OLo0¢+0 
L¢EM(IS¢M[X] ] 
9(O>1TW, 1)IIL2 
WeOl T/L 

A Truncate too-long segments: 
L2:L¢eLLweltw 

A Blank, raveled result: 
R¢E(NxW)o' ' 

S¢S+1 

A X¢MONIOTA L: 
X¢X+LeX€L/-~110,4+\L 

M¢(82 ODR MJCXK+L/S+S] 
9(L3,L4,L5)(J+°1] 

A Left: 

L3:RCX+L/WXLNJ]€¢M 

»L6 

A Right: 
L4:REX+L/ CWXLN) +W-LI€M 

»>L6 

A Center: 

L5:RIX+L/ CWXLN)J+L0.5xW-LI€M 
L6:R¢E(N,W)JoR 


lengths: 
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V R€¢C CNAV CNESTSN;3;L;3S;T;E;010 


A CNAV is used to convert a character nest 
A CNEST to a character vector with the 

a character string C between each segment. 
>(82#0DR CNEST)oeL1 

R¢CNEST 

>0 

A No., starts, 
L1:0T0¢1 
T¢LNCCNESTCOUIO] 

OLTO0¢<0 

L¢CNESTCS¢CNESTEITI] 

S¢S+1 

R¢oeCe,c 

E¢(Of~1+N+N)Jo0 1 0 

A Lengths of segments (incl separators): 
L¢€R+E\L-R 

A Starts of segments (Cin C, CNEST): 
S¢E\R+S+S 


lengths: 
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V CNAV (continued) 

C19] A R¢EMONIOTA L: 

[20] R¢ER+LeREL/-"110,+\L 

[21] Re(cC,82 ODR CNEST)JCR+L/S] 
V 


fWSID: CNFNS] 
V R¢A CNCAT B:D;3sE3;F3G;L;010 


[1] a CNCAT is used to catenate two character nests and 
[2] a return the catenated character nest result. Either 
[3] A argument may be a character nest or a character 
[4] a vector (i.e. a character nest "scalar"). 

C5] OLO<oO 

fF6] A VoV,V°9S,S°V,S°S: 

C7] 9(L1,L2,L3,L4)02182=C(ODR A),ODR Bi 

C8] L1:O1T0¢1 

[9] D¢eLECALOIO) 

C10] F¢ev.G¢eBlOIoO] 

[11] O10¢«0 

C12] D€ACD] 

[13] FeBC(F] 

C14] R¢eCCGtE,D),F+ 1+9A),CC1+E)IA),C1+G)1B 

[15] 30 

[16] L2:LeoBe,B 

[17] FefL+2 

[18] O150¢1 

[19] D¢ev.E¢ALOIO] 

C20] O1o0¢o 

C21] D€ACD] 

C22] R¢€C1+E,D,eA),CC1+E)JA),L,163 ODRCF+F)TB 

[23] 20 

C24] L3:LeoA¢,A 

C25] FefLée2 

[26] OI0¢l1 

(27] Dev.E€BL[OIO] 

C28] O10¢O 

[29] D¢€¢BI[D] 

C30) ReCCE+ 1 2),(0D+2+F),L,163 ODRCF+F) TA) ,C1+E)JB 
C31] 30 

[32] L4:DeoAe,A 

[33] E¢fD+2 

[34] FeoBe,B 

C35] GefF+2 

[36] R¢e(2 3 ,E+4),(D,163 ODRCE+E)TA),F,163 ODR(CG+G)TB 
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CWSID: CNFNS] 
VY R¢ECNEST CNIDX INDS;L;S;N 
CNIDX is used to extract one or more segments 
from a character nest (CNEST). INDS is an 
integer vector or scalar of the indices of the 
segments to be extracted. If a vector, the 
array extracted (R) will be a character nest. 
If a scalar, the array will be a character vector. 
{7 ] L¢€CNESTCOIO+S¢CNESTC1+INDS 1] 
[8] aA Branch if nest result: 
[9] »>CO#eNCoINDS)eL1 
C10] R¢Le82 ODR CNEST[(S+1)+t Li2] 
C11] 20 
C12] Li: Le1i+fL+2 
[13] A R€EMONIOTA L: 
[14] ReR+vepR€L/-7110,4+\L 
[15] Re (N,+\°1101+N) ,L),CNEST(R+L/S] 


| omen | 
A 
| oe 
DD DDD DD D 


C[WSID: CNFNS] 
V R¢A ASSIGN B 
C1] R¢A © assign¢B 
V 


CWSID: CNFNS] 
V R¢CNEST CNIDXA INDS;L;3S;NL3sN3;A;3B;C3NS;13U 
CNIDXA is used to replace one or more segments 
into a character nest (CNEST). INDS is an 
integer vector or scalar of the indices of the 
segments to be replaced. If a vector, the 
array replaced (assign) will be a character nest. 
If a scalar, the array will be a character 
vector. The function ASSIGN simply assigns its 
right argument to the global variable <assign> 
and returns its left argument. After being 
replaced, <assign> is erased. 
[11] N¢CNESTCOIO] 
[12] A Branch unless a scalar index: 
[13] »(€xpeINDS)eL2 
[14] LeCNEST(COIO+S¢CNEST(C1+INDSII 
[15] NL¢passign¢e,assign 
[16] a Index assign if same length: 
[17] A¢fLe2 
C18] BefNL+2 
[19] »C(A#¥B)eL1 
[20] R€CNEST 
[21] R(€(S+11+BI¢NL,163 ODRCB+B)Tassign 
[22] -?L9 
C23] L1i:R¢€S+1)oeCNEST 
[24] I¢C1+INDS+~0I0)+tN-INDS+~01I0 
C25] RCIJ€REIIJ+B-A 
C26] RLS+tOIOI]JCNL 


re 

1) 

Lat 
D>DDIDIIDOIO 
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VY CNIDXA Ccontinued) 


R¢R,(163 ODR(B+B) tassign) ,(S+A+1)1CNEST 
»>L9 

Aa Branch if nothing to assign: 
L2:%(0=U¢pINDS) oL7 

I¢1+ tN 

L¢CNESTCOIO+S¢CNEST(I]] 

A¢l L+2 

A Branch if char vec to assign: 
>(C¢82=0ODR assign)eL3 

R¢iU 
NL¢assign[O010+NS¢assign[1+R]] 
Bel NL+2 

>1L4 

L3:R¢ewul 

NL¢oassign¢,assign 

NS¢€0O 

Bel NL=2 

assign¢NL,163 ODR(B+B)tassign 
A Index assign if same length. 
A Branch if any lengths change: 
L4:%9CACINDS]V.#B)eL8 

A Drop header data off nest if multi-segments: 
>CeL5 

assign¢e(U+1)lassign 

>L6 

L5:BeUpB 

A Repeat data if scalar assignment: 
assign¢(Uxpassign) passign 
L6:R¢CNEST 

I¢SLINDS ] 

L¢1+B 

A S¢MONIOTA L: 
S¢S+leS€L/-7110,4+\L 
R(S+L/I]l¢assign 

>L9 

L7:R¢CNEST 

»L9 
L8:ICINDSJ¢N+R 

L¢1+(A,BIJCTI 
S¢(S,NS+oCNEST) [TI] 

A R¢MONIOTA L: 

R¢ER+ LoReL/-7110,+\L 
R¢(N,+\71101+N),L), CCNEST,assign)[R+L/S] 
L9:QERASE ‘assign’ 
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CWSID: CNFNS] 

VY L¢CNLEN CNEST;O01I0 
C1] a CNLEN is used to return the lengths of the 
[2] a segments of the character nest right argument. 
C3] OLI0¢1 
[4] L¢tCNEST[1] 
[5] DIO<0 
C6] L¢CNESTECCNEST(CL]] 


[WSID: CNFNS] 
VY R¢CSEQ CNGRADEUP CNEST:SA;3;B;sD;3sGsIsL:;MsN;PsR3sS;sT32Z 
[1] aA CNGRADEUP is used to determine the grade vector 
[2] a which can be used (with CNIDX) to sort the 
[3] aA segments of CNEST into ascending order. The 
[4] aA collating sequence used is in CSEQ. 
[5] N¢CNEST([OIO) 
[6] a Start, lengths: 
C7] SCOIO+CNESTCI+ iN] 
C8] L¢[CNEST(S1]+2 
[9] aA Empty segments precede others in coll. seq.: 
[10] R¢ABeL>0 
[11] A Z:indices into CNEST of remaining elts (i.e. RIA]): 
C12] Z¢B/.N 
[13] a Done if O or 1 segments or all empty: 
[14] 2CC(N>1)AxpZ)10 
[15] A Az:indices into R of remaining elts (always 
[16] aA ascending), BERI]/.UN: 
[17] A€(N-9Z)+teZ 
C18] A 1st col (2 chars as an integer) of data: 
C19] D¢CNEST(S£2Z)]+1I¢1] 
C20] A Reorder grade vec: 
[21] RLAIJ¢ZC0G¢CSEQ4(C CED) ,2)082 ODR DI 
C22} D¢eD[G] 
[23] A Flag lst elts of grps of like values (partition vec): 
[24] P¢D#° 10D 
[25] aA Flag elts of >1 elt grps (map vec): 
[26] LP:M¢PA1OP 
[27] ~>CeAeM/A)10O 
C28] P¢eM/P 
[29] aA Skip following logic if no segments end here: 
[30] »CA/BEL(ZERLIAI]]>I)oeL1 
[31] aA Shift enders to front or end of grps: 
[32] GeaAB 
C33] GeGLla(Te+\P)[G]] 
[34] RCAI€Z(G] 
C35] Be¢B[G] 
[36] D€¢B/T 
[37] P€eD# 10D 
C38] M¢PA1OP 
[39] ~>CpAeM/B/A)L0 
[40] Z¢RLAI 
C41] PeM/P 
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VY CNGRADEUP (continued) 
[42] Li:I¢I+l 


[43] D¢CNESTES(ZI)+T] 
[44] G¢CSEQ4((C0D),2)e82 UDR D 
[45] GeGiAC+\P)[G]] 
C46] RLAIJ€ZCG] 
[47] D€¢DC[G] 
[48] PePvD#710D 
[49]  >LP 
V 


[CWSID: CNFNS] 
V REA CNEO B;3sC:D;3E3:S3:T3sL;K3;2;010 


[11 aA CNEQ is used to compare two character nests of the 
[2] a same length (i.e. number of segments) for matching 
[3] aA segments. The result is a Boolean vector with as 
[4] aA many elements as segments and with a 1 for each 
[5] a segment which is the same in both arguments. A 
[6] aA character vector argument (Cor arguments) is 

[71 a treated as a character nest "scalar" and is 

C8] A compared against all the segments of the other 
[9] A argument. Thus, if both arguments are character 
[10] A vectors, the result will be a scalar 1 if the 
C11] A vectors are equivalent and will be 0O otherwise. 
[12] A Vov,V°S,S°V,So°S: 

[13] O10¢0 

C14] 2%(L1,L2,L3,L4)02182=(ODR A),ODR Bi 

[15] L1:O10¢1 

[16] T€tAl1] 

C17] OLoO¢o 

[18] A Extract starts, lengths: 

[19] S¢AC(T] 

[20] T¢BI[T] 

C21] 2Z¢0#L¢BIT] 

[22] A Compare values when same nonzero lengths: 

[23] E¢ZaAR¢A[S]=L 

[24] Le! CE/L)+2 

[25] S¢E/S 

[26] T¢E/T 

C27] OF0¢1 

C28] A Z¢MONIOTA L: 


[C29] Z¢Z+rvezZeL/-~110,+\L 
[30] S€¢Z+L/S 

[31] T¢Z+L/T 

C32] O10¢0 

[33] A Compare values: 
[34] K¢+\AL[S]=B(T] 

C35] OLfo0¢l 

[36] KeKC+\LI 

[37] A Are they all equal? 
[38] RLE/tpEI]¢L=AK-7~110,K 
[39] 30 

C40] L2:Or0¢1 
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V CNEQ (continued) 
[41] S¢r.1ACl1]J 
C42] O10¢0O 
[43] S¢ATS] 
[44] R¢Al(S]=LeeBe,B 
C45] »C€xL)L0 
[46] O10¢1 
[47] S¢CR/S)o.+v_.Lef L+2 
C48] O1L10¢0 
C49] R¢R\ALSIA.=163 ODRCL+LITB 
[50] 30 
[51] A Reverse SoeV args and use VoS logic: 
[52] L3:T¢eA 


[53] A€¢B 
[54] BeT 
[55] ~L2 
[56] L4:A¢,A 
C57] Be,B 


[58] »CR€CeAJA.=eB)10 
[59] R¢AA.=B 
V 


CNFNS J 


CWSID: 
3G3;1I3;L;LAB;LL:;:M;:S;: 


V INDS<¢CNEST CNIOTA VALS3;A3;B;BASE;C;F 

SHAPE ;SS;cseq 

CNIOTA searches through the character nest left 
argument CNEST for the character nest, vector 
or scalar right argument VALS. The result is 
an integer vector (for nest right arg.) or 
scalar (for character right arg.) of the 
segment index in which the character vector was 
first located (as an exact, not partial, match) 
or 1 greater than the number of segments if not 
found Cala dyadic u). 

Requires subfns: CNGRADEUP,CNCAT,CNIDX,CNEQ. 
Branch if right arg a nest: 

C12] Ce¢CNESTCOTIOI 

C13] 27%C€8240DR VALS)eLl 

[14] A Handle char. vec or scalar right arg: 

C15] L¢eVALS<,VALS 

[16] SHAPE<€toO 

[171 A Length of arg. in integers: 

[18] M¢fL+2 

[19] A Convert arg. to integers: 

C20] VALS¢€163 ODRCM+M)TVALS 

[21] >L5 

[22] L1:A¢VALS[OIO]I 

[23] A Branch unless no segments in either arg: 

[24] 2(xF¢ALC)eL2 

[25] A Handle empty arg: 

[26] INDS+AeDIO 

C27] 20 


_ 
OV 
Ld 
DDDIDIODDIIDDIDIODO 
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V CNIOTA (continued) 


[28] A Branch if both args have more than 1 segment: 


[29] 
[30] 
[31] 
[32] 
[33] 
[34] 
[35] 
[36] 
C37] 
(38] 
[39] 
[40] 
[41] 
[42] 
[43] 
C44] 
C45] 
[46] 
C47] 
C48] 
[49] 
[50] 
C51] 
C52] 
[53] 
[54] 
[55] 
[56] 
[57] 
[58] 
c59] 
[60] 
C61] 
[62] 
C63] 
[64] 
[65] 
[66] 
[67] 
[68] 
[69] 


L2:3(F#1J)0L7 

A Branch unless left arg has 1 segment: 
9*(C#1)eL4 

A Handle 1 segment left arg: 
L¢CNESTC2+010] 

Mel L+2 

A Extract segment as integers: 
CNEST¢CNESTE3+ UM] 
S€COIO+VALS(T1+1tA] 

Be L=VALS([S] 

A Branch if segment empty: 
>(xL)IJL3 

A Indices of same-length segments: 
S€C(€~0I0)+B/S)°0.+1M 

Be B\VALS[{SJA.=CNEST 
L3: INDS<OIO+~B 

>0 

A Handle 1 segment right arg: 
L4:SHAPE¢1 

L€¢VALS(2+0I0) 

M¢f L+2 

VALS¢VALS[3+UM] 
L5:S¢01I0+CNEST([1+1iC] 

A Flag same-length segments in left arg: 
Be L=CNEST(S] 

A Branch if segment empty: 
»9(xL)lL6 

A Indices of same length segments: 
S¢((C*+0I0)+B/S)°0.+ iM 
B¢EB\CNEST[S]A.=VALS 

L6: INDS¢SHAPEeBtl 

>0 


A Branch if sort alg. costs more than looping alg.: 


A Cremove A after replacing C1,C2,C3,C4 by 
A computed constants): 
L7: Ax*C0C4+C5xXL+A)>C1+AxXC2+C3xXL) oLS5 


A Combine args. and sort (like values together): 


cseq¢DAVv 
F¢G¢ODAV CNGRADEUP A€CNEST CNCAT VALS 
A F¢AG: 


FELG]€ieG 

A Flag 1st of distinct rows by shifting and comparing: 
FEeC~A CNEQ A CNIDX( -10G)[F]){£G] 

A Insure ist elt is 1 (in case all rows the same): 
FLOTIOJ€1 

A Indices of 1st distinct rows: 

I¢F/G 

A Replicate for each like row: 

FCOIOIJ€¢O10 

IT¢IC+\FI 
A Unsort indices (to catenated order): 

INDS¢I 


[70] 
C71] 
C72] 
C73] 
C74] 
C75] 
C76] 
C77] 
C78] 
C79] 
C80) 
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[81] 
[C82] 
[83] 
[84] 
[85] 
[86] 
C87] 
C88] 
C89] 
[90] 
[91] 
[92] 
[93] 
[94] 
[95] 
[96] 
C97] 
[98] 


V 


CNIOTA (continued) 

INDS[GJ]¢I 
A Keep those corresponding to right arg: 
INDS¢«ClLINDS 

A Set ‘not found’ inds to ‘one greater’: 
INDS¢INDSLC+O0I0O 

30 

RQ Use looping algorithm if more efficient: 
L8: INDS¢Aoe0 

LAB (AeL9) ,0 
LL€CNEST(CSS<€OIO+CNESTL1+LCNESTCOIOJ) 1 

a Starting indices from which to add uM: 
SS¢SS+~0IO 

I¢O0IO 

L9: BELL=L¢VALSECS€OIO+VALS[1+I1]] 

M¢ef L=2 

B¢B\CNEST(E (CB/SS)°.+ULMI]A.=VALSDE (S+~0I0)+ iM] 
INDSCIJ¢Bi1 

»>LABL IT¢I+1] 


[WSID: NNFNS] 


VY R¢EP NNEST V;sOLO3E;sI;N 


V 


A NNEST is used to convert a numeric vector V into 
A a numeric nest. P is the replication vector used 
A to partition the vector. That is, 

A CeP/0)=Ce,V). If P is a singleton, it is 

A replicated as much as necessary to encompass V. 

A Replicate left arg.: 

OLO<0 


Ve.V 

>(14N¢oP¢,P)eL1 

A Repeat P if singleton: 
Nee P¢CCoeV)+PJeP 

Li: IT¢e+\ 1L0C1+N) ,E€1+P 
A Fill with 2s for now: 
R¢(N,1I),E/2 

A Insert lengths: 

RC IJ¢P 

OLO¢+21 

A E*MONIOTA P: 
ECE+LoE¢P/-"~110,+\P 
OLO0¢0 
A Insert data: 
RLE+P/1TJ¢V 
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C1] 
[2] 
C3] 
C4] 
[5] 
C6] 


C1] 
C2] 
C3] 
C4] 
[5] 
C6] 
C7] 
C8] 


C1] 
C2] 
[3] 
C4] 
(5] 
C6] 
C7] 
C8] 
C9] 


C1] 
C2] 
C3] 
[4] 
C5] 
C6] 
C74] 
C8] 
[9] 


C[WSID: NNFNS] 


V R¢A NNCATSS B:D:F 
A NNCATSS is used to catenate two numeric vectors (i.e. 
A numeric nest "scalars") to form a 2-segment numeric 
A nest. 


DeoAc,A 
FeoBe,B 
R¢€2 3 ,D+4),D,A,F,B 


CWSID: NNFNS] 


VY REA NNCATVS B:D:3:E3;L;U010 
A NNCATVS is used to catenate a numeric nest (A) toa 
A numeric vector (B, i.e. a numeric nest "scalar"). 


L¢oBe,B 

OLO¢+1 

D¢v_LE¢AL[1] 

OLO<0 

D¢A[D] 
R¢EC1it+tE,D,eA),CCI+EJJA).L,B 


CWSID: NNFNS] 


V R¢A NNCATSV B;3D;3E;L;010 
A NNCATSV is used to catenate a numeric vector 
aA CA, 1.e. a numeric nest "scalar") toa 
A numeric nest (B). 


L¢eopAc,A 

DIO¢+1 

D¢LE¢BI[1] 

DLO<O 

D¢eB[D] 

R¢eCCE+ 12 2),0Dt+2+L),L,A),C1I+EJIB 


[WSID: NNFNS] 


V R¢A NNCATVV B:D:E:F:G;010 
A NNCATVV is used to catenate two numeric nests to form 
A a longer numeric nest. 


HNTO¢|1 

D¢LE¢€AL[1] 

F¢€iG¢B[1] 

DIO¢O 

D¢ACD] 

F¢eB(F] 

R¢€C(CG+E,D),F+ 1+9A),CC1l+EVIA),C1+G)1IB 
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CWSID: NNFNS] 


V R¢eA NNCAT B;sD;sE;F;3;G:L;010 


A 
A 
A 


A 


Li: DepAt,A 
FeoBe ,B 
R¢€C2 3 ,D+4),D,A,F,B 
>0 
L2: L¢oBe,B 
OI0¢+1 
De LEC@o0A 
OTO¢0 
D¢AL[D] 
R¢E(1+E,D,oA),(C1+E)JA),L,B 
ae) 
L3:L¢oAe,A 
OTO¢+1 
D<«LE¢60B 
OLO<O0 
D¢BLD] 
RECCE+ 1 2),¢0D+2+L),L,A),C1+E)J IB 
+0 
L4:010¢1 
D¢LE¢Bo0A 
FELGeOoB 
OLO<0 
D¢ACD] 
FeB(F] 
REC (G+E,D),F+ 1+pA),CCL1+EVIA),C€C1+G)1B 
Vv 
CWSID: NNFNS]J] 
V RENEST NNIDX INDS;1;L;35;N 
A NNIDXA is used to replace one or more segments 
A into a numeric nest (NEST). INDS is a 
A numeric vector or scalar of the indices 
A of the segments to be replaced. If a vector, 
A the array replaced (assign) will be a 
A numeric nest. If a scalar, the array will be 
A a numeric vector. The function ASSIGN simply 
A assigns its right argument to the global 
A variable <assign> and returns its left argument. 
A After being replaced, <assign> is erased. 
L¢NESTCOIO+S€¢NEST(1+INDS 1] 
A Branch if nest result: 
+(O#pN¢pINDS)eL1 
R¢NESTE (S+1)4+tb] 
>0 
L1: L€1+L 


NNCAT is used to catenate two numeric nests 
and/or vectors. The non-nest arguments are 
provided as matrices. 

OLO<0 

SeS;VeSs;Sov3sVov: 
>(L1,L2,L3,L4)0(1=99A)+2x1=p0B] 
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[17] 
[18] 
[19] 


C1] 


C1] 

C2] 

to 

C4] 

[5] 

C6] 

C7] 

C8] 

C9] 

C10] 
C11] 
C12] 
[13] 
[14] 
C15] 
C16] 
[17] 
C18] 
[19] 
[20] 
C21] 
[22] 
[23] 
C24] 
[25] 
[26] 
C27] 
[28] 
[29] 
[C30] 
C31) 
[32] 
[33] 
[34] 
[35] 
[36] 
C37] 
C38] 
[39] 


V NNIDX (continued) 
A L<¢MONIOTA L: 
Tel+#lpI€L/-~1l0,4+\L 
REC(N,+\ 1LJC14+N) ,L) ,NESTLI+L/S] 
Vv 


[WSID: NNFNS] 
V R¢eA ASSIGN B 
R¢A © assign¢eB 


v 


[WSID: NNFNS] 
V RENEST NNIDXA INDS3SL3S;D;NL;3;N3;NS;31;3U 

NNIDXA is used to replace one or more segments 
into a numeric nest (NEST). INDS is a 

numeric vector or scalar of the indices of the 
segments to be replaced. If a vector, the 
array replaced (assign) will be a numeric 
nest. If a scalar, the array will be a 
numeric vector. The function ASSIGN simply 
assigns its right argument to the global 
variable <assign> and returns its left argument. 
After being replaced, <assign> is erased. 
N¢NESTLOIO] 

A Branch unless a scalar index: 

+(XpoINDS)oeL2 

L¢NESTCUIO+S¢NEST(C1+INDSI] 

NL¢eD¢,assign 
A Index assign if same length: 

>(L*NL)eL1 

R¢NEST 

R(CS+1)+tLI¢D 

>L5 
L1:Re(S+1)oNEST 

I¢€C€1+INDS+~01I0)+ iN-INDS+~+01I0 

RCLI]J¢+RLIIJ+NL-L 

RIS+OIOJ¢NL 

R¢R,D, (S+L+1)INEST 

>L5 

A Branch if nothing to assign: 
L2:>?CO=eINDS )oL3 
I¢€1+iuN 

R¢.U¢ ( Deassign) [OIO) 
LENESTCOIO+S¢NESTEIIJ] 
NL¢D(LOUIO+NS¢D[1+RI]): 
a Index assign if same 
>*(CLCTINDS]V.#NL)oL4 
R¢NEST 

I¢SCINDS] 

L¢1+NL 
A N€MONIOTA L: 
N¢eNt+LeNeL/-~110,+\L 


DDDDIIDIDDOIODINDND DOD 


length: 
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V NNIDXA Ccontinued) 
[40] RUIN+L/LJ]¢€(U+t1)1D 


[41] >L5 
[42] L3:R¢NEST 
C43] %L5 


[44] L4:ICINDSI<N+R 
[45] Le1+C(L,NLICIT] 
[46] S€(S,NS+oNEST)(T] 
[47] A LTe€MONIOTA L: 
[48] II+ ve IeL/-7110,+\L 
[49] Re(N,+\ 11 01+4+N) ,L),C0NEST,DJCI+L/S] 
[50] L5:OERASE ‘assign’ 
Vv 


{[WSID: NNFNS]J 
V L€ENNLEN NEST;UI0 

[1] aA NNLEN is used to return the lengths of the 
[2] aA segments of the numeric nest right argument. 
C3] OLO<¢+1 
[4] L¢€uNESTE 1] 
[5] OIO<0 
[6] L¢NESTCNESTCECL]] 


LWSID: NNFNS] 

V R¢NEST NNSUMCOL COLS:I:L:N:S 
[1] aA NNSUMCOL is used to sum the Nth column of each 
[2] a M column matrix item (raveled) in the numeric 
C3] A nest NEST. COLS is M,N. 
[4] N¢NESTCOIOI 
[5] S¢NESTL1+iLN]+0I0 
[6] L¢NEST(LS1+COLS[OIO] 
[7] Iel+vpIeL/-7110,+\L 
[8] R¢+\NEST( CL/S+(C~+O010)+COLS[1+010)])+COLS[LOIO]xI-OI0] 
[9] R¢RE C-~0OT0)++4+\LI 
{10] R¢R-"1Jl0,R 
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CURVE FITTING 


The 


The 


Use 


The 


The 


AMT¢60.62 59.57 56.70 60.42 
MAT¢4 4032 61 15 82 35 104 10 82 37 83 5 85 25 62 14 85 
AMTEHMAT 

0.15 0.05 0.73 0.51 


formula is: SUPPLY = A+(BXxTIME) 
supply is exhausted when SUPPLY=0, i.e. when: 
O = At+(BxTIME) 
-A = BxTIME 
TIME = -CA+B) 
AH to determine A and B and then plug into the last equation: 
SUPPLY€1850 1772 1705 1508 1490 1250 
TIME¢1 23569 
C¢SUPPLYH1,[1.5]TIME (C is A,B) 
== /C 
25.6 
supply will be exhausted between weeks 25 and 26. 
formula is: CRA2) = COX-CX)42)4+00CY-CY)*2) 
or: CR*2) = CX*2)4+0CY*2)+0CK*42)+4+0CY%*2)4+0C° 2XXxXCX)I+0 2XYXCY) 
So: CXA2)4+0CY*2) = (CR*2)4+0-CX*2)4+0-CY¥4*2))4+02xXXxXCX)+0C2XxXY¥xCY) 
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Use B8 to determine the coefficients: 


X¢4 3247 6 
Ye1 2352 4 
LEFT¢(X*2)+(CY*2) 
MAT€1,2xX,(f1.51]Y 
C¢LEFTHMAT 
N<+cx¢c[2] 


N<ecy<¢c[3] 
2.9 


Since: C{L1] = (R*2)4+0-CXK*2)+(-CY4*2) 
then: (R*2) = CLLI+CCX*2)+(CY*2) 
SO: 
O<+R€ECCL1LI+0CCX*2)+0CCY*2))*.5 
2.2 


Therefore, the center is (4.6,2.9) and the radius is 2.2. 


4. For: Tl = C4+(€C5x(CR+L)) 


CeT1H1,01.5]R+L 
C4¢Cl1] 
C5¢«C[2] 


For: T2 
2 


C1+(CRx€C2+(CC3xL) )) 
C1+€C2xR)+CC3xXRXxL) 


C¢T2H1,R,01.5)]RxL 
C1¢Ci1] 
C2¢CL[2] 
C3¢C[3] 
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FINANCIAL UTILITIES 


or 


or 


1000x(€1+365 EFFECTIVE .11)*1.5 
1179.36 


1000xX(€1+.11+365)*365x1.5 
1179.36 


tERM¢40+52 
pAY¢10x52 
pER¢52 
PpDEF<¢0O 
GdEF¢-tERM 
CONV€12 
iNTe. 08 
VALUE 
412.84 


VerzC1+.08412)%12 
10x CV*-40452)xC1-V*40+52)4+1-V*=52 


412.84 
PI¢SCHEDULE 4 12 12 012 .14 (48 rows) 
TOTPRIN¢+/PIC[ 31] (Loan amt. at $1 per month) 
PI¢PIx10000+TOTPRIN (Cvt. loan amt. to $10,000) 
+/PIC13] (Monthly pmt.; sum any row) 
273.26 
+/~12TPICE 31] (Repayment amt. is principal 
3043.47 outstanding, i.e. remaining 
principal payments) 
+/36TPI[ 32] (Sum 3 years of interest pmts.) 
2881.01 
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4. DATES€19870101 19870701,101+10000x1991+115 
AMTS€1000x-10 ~5,1503 
DATES IROR AMTS 
0.1058 


C[WSID: INTEREST] 
VY BV€DATE FCBOOK PARAMS ; COUP; CRATE;D;DAYS;F;MDATE;MVAL;N 
*>NCOUPS;P;PAR3;3S3;YLD 


C1] aA Returns book values as of DATE CYYYMMDD), a scalar, 
[2] a for fixed-coupon securities defined in the matrix 
[3] a PARAMS, one row per security. Value is computed 
[4] aA after coupon is received if DATE is a coupon date. 
[5] aA Result has shape: (1Te@PARAMS). 

C6] A 

C7] A PARAMS[;11] par value 

[8] A [32] maturity date (YYYYMMDD) 

[9] A C33] annual coupon rate 

[10] A C34] number of coupons per year 

[11] a [;5] couponly yield rate 

Ci2] aA [;6] Coptional}) maturity value (par value 
C13] aA if omitted) 

[14] a 


[15] PAR¢MVAL¢PARAMSE ;1] 

[16] MDATE*+PARAMSE :2] 

[17] CRATE*¢PARAMSL ;31+NCOUPS¢PARAMSLE ;4] 
[18] YLD¢PARAMS[ ;5] 

[19] »(€5=11loPARAMS) oSTART 

[20] MVAL¢PARAMS[ ;6] 


C21] A 

C22] A Formula: 

[23] A 

[24] A BVCt) = CMVALx(C1+Y)*-WCt))+PARXCRATEX(1-(€1+Y)* 

C25] A -W(T))+Y 

[26] A 

[27] A where: BVCt) = book value at time t (Ca coupon date) 
[28] a Y = couponly yield rate 

C29] A W(t) = the number of (Cwhole) coupon periods 
[30] aA remaining from time t to MDATE 

C31] A 


[32] A Compute approx days (360 days/yr) from specified 
[33] A DATE to maturity (change 31 days to 30): 

(34] START: DAYS€ 360 30 1 +.x 0 100 100 TMDATE-31=100!1MDATE 
C35] DAYS€DAYS- 360 30 1 +.xX 0 100 100 TDATE-31=1001 DATE 
[36] A No. coupon periods from specified date to maturity: 
C37] N«CDAYSXNCOUPS)+360 

C38] A Coupon payment: 

[39] COUP€PARXCRATE 

C40] A Book value at prior coupon: 

[41] DeC1i+YLD)*«-IN 

[42] P¢(MVALXD)+COUPX(1-D)=YLD 
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V FCBOOK (continued) 
[43] A Book value at subsequent coupon: 
[44] DeC1+YLD)*-LN 
[45] S¢C(MVALXD)+COUPx(€(1-D)+YLD 
[46] A Fraction of per. from specified date to next coupon: 
[47] FeN-LN 
[48] aA Interpolate between prior and subsequent book vals: 
[49] BV¢CPxF)+Sx1-F 
V 
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EXCEPTION HANDLING 


1. APL*PLUS: 


V SHOWFILE NAME; LIM;N;0ALX;0ELX 
Ci] OALX<OELX¢ ’ ODM’ 
[2] LIM¢TIEFILE NAME 
C3] OALX<’ ASK’ 
[4] DELX¢'>C''ATTN’’A.=4TODM)/ASK © ODM’ 
[5] ASK:U¢'’BEGIN WITH WHICH RECORD?’ 


SHARP APL: 


APL2: 


VY SHOWFILE NAME:LIM;:N;O0TRAP 
[1] OTRAP€<€'! ’ 
[2] LIM¢TIEFILE NAME 
C3] OTRAP€'V1000 C ®ASK VO C XC''ATTN' 'aA, =4T5I/0ER(OIO; ]) 
/ASK' 
[4] ASK:0¢€'’BEGIN WITH WHICH RECORD?! 
etc. 


Cannot be solved since attentions are not detected as 
exceptions 


CWSID: INPUT] 
VY R¢NUM NXPROMPTE PROMPT; DELX 
Displays character vector PROMPT, allows keyboard 
input on same line and returns numeric vector 
response of length NUM (Cor of any length if NUM=0O). 
Returns numeric scalar escape code if escape word 
entered. Allows and executes primitive APL 
expressions. Requires: CPROMPTE. 
APL*PLUS version. 
SHARP APL: localize OTRAP instead of OELX. 
APL2: localize neither DELX or OTRAP. 
[10] DOELX<’ODM’ aA APL*PLUS 
[11] A OTRAP<’'’ A SHARP APL 
[12] Li:R¢CPROMPTE PROMPT 


| om | 

O1 

bend 
DDDODIDIIDIVIOO 
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3. 


C13] 
[14] 
C15] 
[16] 


C17] 
[18] 
C19] 
[20] 
C21] 
[22] 
[23] 
[24] 
[25] 
[26] 
[27] 
[28] 
[29] 
[30] 
[31] 
C32] 
[33] 
[34] 


C35] 
C36] 


C37] 
C38] 
[39] 


C1] 
C2] 
C3] 
C4] 
[5] 
C6] 
C7] 
C8] 
[9] 


EXCEPTION HANDLING 


VY NXPROMPTE (continued) 


A 


A 


A 


A 


A 


A 
A 


OELX¢’>L4' 


Exit if scalar escape code: 


>CpeR)10 


Branch if any letters (but E) in response: 


»(vV/RE' ABCDFGHIJKLMNOPORSTUVWXYZAabcdefghijklmnopqrstu 
vwxyZA')/L2 


Branch if any E not used in exponential notation: 


9(V/CR='E!’ JA 7111,~RE' 01234567890! )/L2 


Execute expression and ravel (trapping error): 
A APL*PLUS 


OTRAP«’VO E »L4' A SHARP APL 


R¢e,@R A APL*PLUS or SHARP APL 


'»L4’ OEA 'Re,',R A APL2 
Check that result is numeric: 


»(0=1T0eR)/L3 


L2:O0¢'** ENTER NUMBERS OR APL EXPRESSIONS ONLY xx! 


A 


>L1 


Exit if NUM is 0 or is length of input: 


L3:?>NUMLO 


A 


A 
A 


A 


Vv 


+ (NUM=eR)/0 
O«'s** ENTER ',(3NUM),’ NUMBER’,(NUM=1)1'S xx! 
>L1 
L4:0¢’'s* INVALID EXPRESSION CAUSING: 


',ODM A APL*PLUS 
L4:0¢'*x* INVALID EXPRESSION CAUSING: ',5/0ERCOIO;] a 


SHARP APL 


O<+1 OLOUER A SHARP APL 


L4:0¢’*x* INVALID EXPRESSION CAUSING: ' ,.gQEMCOUIO;] a 


APL2 


O¢1 OLOEM a APL2 


he’? 
>L1 


Below are the SHARP APL (CERRATTNS) and APL*PLUS CERRATTNP) 
solutions to the problen. 
because attentions are not considered exceptions (to be handled) 
and because the OEA and DEC system functions are not designed to 
allow the type of "environment conditioning" required in the 

problem. 


The problem cannot be solved with APL2 


CWSID: ERROR] 


V ELX ERRATTNS ALX 


DDDDIDIDDDDND 


SHARP APL version. 

Sets OTRAP so that error will cause »ELX at 
the SI level at which OTRAP is local: and 
break will cause »ALX at that level. If ELX 
or ALX is a character vector, it is executed 
at the level at which the error ar attention 
occurs. If ELX or ALX is empty. no special 
error or attention handling is included in 
the local OTRAP. Be sure to localize OTRAP 
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V ERRATTNS (continued) 
[10] a in the calling function. 
[111 QTRAP€!?! 
[12] »(O€pELX) pL2 
[13] A Branch if numeric ELX: 
[14] »9CO=1TOPELX) eL1 
C15] QOTRAP€’V O EF ' ,ELX 
[16] %L2 
[17] L1l:OTRAP€’V OC %’ ,tELX 
[18] L2:>(0€pALX) 90 
[19] A Branch if numeric ALX: 
[20] 7(0=1TOPALX) eL3 
21] OTRAP¢COTRAP,’V 1000 E ° ,ALX 
[22] 20 
C23] L3:OTRAPCOTRAP,'’V 1000 C >’ ,éALX 
V 


CWSID: ERROR] 
V ELX ERRATTNP ALX 


C1] A APL*PLUS version. 

[2] a Sets DELX and possibly OALX so that error 
[3] a will cause »ELX at the SI level at which 

[4] ea OELX is local; and break will cause »ALX 

[5] a at the SI level at which DALX is local (uses 
[6] A OERROR to percolate to proper level). If 
[7] aA ELX or ALX is a character vector, it is 

[8] aA executed at the level at which the error or 
[9] a attention occurs. If ELX is empty, OELX is 
C10] A set to 'ODM’. If ALX is empty, OALX is set 
[11] a to 'ODM’ if DALX is localized in the calling 
[12] aA function, and otherwise is not modified. Be 
[13] A sure to localize OELX in the calling function. 
[14] a Localize DALX only if ALX is not empty. 


[15] »CO€pALX)eL2 

[16] A Branch if numeric ALX: 
[17] 2»CO=1TOPALX)eL1 

C18] A Character ALX: 

[19] DALX<ALX 


[20] >2L3 

[21] A Numeric ALX: 

[22] L1:0ALX¢’¢(~1=1eDIDLOC’ 'OALX'')/''QERROR’!'’'ATTN''''''o 
>' ,¢ALX 

[23]  %L3 


C24] A Empty ALX; branch if OALX not local to calling fn: 
[25] L2:%C 1=1T1l,Q0IDLOC 'OQALX’ JeL3 

C26] OALX¢’ODM’' 

[27] L3:%(0€eELX)IL4 

[28] ELX¢’ODM’ 

C29] A Branch if numeric ELX: 

[30] L4:39(0=1TOpELX) 0 L6 

[31] A Branch if non-empty numeric ALX: 

[C32] 27CCO#I1TOPALX) VOEPALX)I ILS 
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V ERRATTNP (continued) 
[33] A Character ELX; character or empty ALX: 
[34] OELX¢ELX 
[35] 0 
[36] A Character ELX; numeric ALX: 
[37] L5:0ELX¢’?eCC 14#1e0IDLOC' 'OALX'')A’'ATTN’ 'A.=4TODM)/''>' 
» ($ALX),''’ 'OOERROR(’ /ATTN’ 'a.=4TODM)/4TODMO' ,ELX 
[38] +0 
[39] L6:>CCOZF1TOPALX) VOEPALX)ILL7 
[40] A Numeric ELX; character or empty ALxX: 
[41] OELX¢’e@C"1=l1e0IDLOC’ 'OELXK'’)/’' 'OERRORCA\ODMZOTCNL) /ODM 
''O9' , SELX 
C42] 30 
[43] A Numeric ELX; numeric ALX: 
[44] L7:0ELX¢’e@C"1=1e0IDLOC’ 'OELX’')/' 'NERRORCA\ODMZOTCNL) / 
ODM'’OsC' ATTN’ 'A.=4TODM)O’ ,dC1T, ELX),1T,ALX 
V 
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INDEX 


accumulation 92 
acknowledgements 3 
ADDEMP (wsid: MSF) 
age computation 139 
DAI, accounting information 23 


177 


322 
ONALX, attention latent 
expression 308 
amortization 302 
ANDRED (Cwsid: REDUCE) 344 
annuity 285 
APL files 203 
APL2 1 


APL*PLUS 1 


ASSIGN Cwsid: CNFNS) 463 
ASSIGN (wsid: MULTIFLO) 423 
ASSIGN (wsid: NNFNS) 472 


308 
154 


attention detection 
attention handling 
autostarting 185 


DAV, atomic vector 86 151 245 260 


BDAM 447 
best fit 278 
blank manipulation 40 


bond accrued interest 144 


bond calculations 297 
book value 301 
Boolean storage 34 105 


Boolean techniques 105 230 244 
BOOLEAN ws 316 
branching 8 172 


BY (wsid: SEARCH) 76 


C- “37 
canonical representation 26 189 
CASHBAL ws 316 


CASHBAL) 16 
CASHBAL) 17 


CASH1 (Cwsid: 
CASH2 (wsid: 


CASH3 (wsid: CASHBAL) 18 


CASH4 (wsid: CASHBAL) 325 
CATEMP (wsid: MSF) 178 
CATREC 216 

CATREC (wsid: MULTIFLO) 392 
CATRECWS 216 


CATRECWS (wsid: MULTIFLO) 396 


CENTER (wsid: FORMAT) 330 
centering 50 

CGRADEUP 86 267 

CGRADEUP (Cwsid: SORT) 60 
CGRADEUP1 Cwsid: SORT) 57 
CGRADEUP2 (wsid: SORT) 59 
character data positioning 40 


character matrix searching 64 

character matrix sorting 56 

character nest 254 

character substring replacement 
76 

character substring searching 75 

CINPUT (Cwsid: MSF) 178 

circle 280 

CJUST (Cwsid: FORMAT) 42 

classification 92 

CMIOTA 39 85 280 

CMIOTA (wsid: SEARCH) 69 

CMIOTA1 (wsid: SEARCH) 65 

CMIOTA2 (wsid: SEARCH) 66 


CNCAT (wsid: CNFNS) 462 
CNEQ (wsid: CNFNS) 466 
CNEST Cwsid: CNFNS) 459 
CNFNS ws 316 

CNGRADEUP (wsid: CNFNS) 465 
CNIDX (Cwsid: CNFNS) 463 
CNIDXA Cwsid: CNFNS) 463 
CNIOTA Cwsid: CNFNS) 467 
CNLEN (wsid: CNFNS) 465 
CNAM (wsid: CNFNS) 460 
CNAV Cwsid: CNFNS) 461 


COBOL 37 157 
coding 162 170 
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collating sequence 56 

COLUMNIZE (wsid: FORMAT) 331 

comments 171 

COMMENTS ws 316 

compiled functions 37 

compiler 19 37 

compounding 283 

COMPRESS 218 

COMPRESS (wsid: MULTIFLO) 409 

compressed Milky-Way array 112 

conditional branching 8 

CONTINUE workspace 186 

continuous payments 287 

COST (wsid: TIMING) 327 

CPROMPT (wsid: INPUT) 119 

CPROMPTE (wsid: INPUT) 124 

CPU time 23 

OCR, canonical representation 
189 245 

cross-tabulation 92 

CRTIMING ws 316 

CRAVR 193 198 245 

CRAVR (wsid: FNREP) 190 

OCT, comparison tolerance 
65 72 449 

curve fitting 271 


59 64 


data storage 34 

date formatting 133 

date manipulation 135 

date representation 129 
date utility functions 136 
date validation 131 

DATES ws 317 

day of week 144 

DEB (wsid: FORMAT) 41 

DEB (wsid: SEARCH) 336 
ODEF, define 26 198 

DELEMP (wsid: MSF) 178 
deletion flag 207 227 
delivery 163 174 

DELREC 218 

DELREC (wsid: MULTIFLO) 407 
diagonal 82 89 

diamond 24 

diamonds 29 

directory files 205 
distinct values 84 

DLB (wsid: FORMAT) 41 

ODM, diagnostic message 304 
ODR, data representation 260 
DROPFN (wsid: FNSFILE) 385 
DTB 256 


INDEX 


DTB (Cwsid: FORMAT) 41 


OBA, execute alternate 121 305 
each 12 

DEC, execute controlled 306 
OFC, error conditioning 307 
EFFECTIVE (wsid: INTEREST) 284 
efficiency 23 33 

DELX, error latent expression 304 
ORM, event message 305 
embedded assignment 55 

EMMA 216 

EMPLOYEES (wsid: FLF) 176 
EMPLOYEES (wsid: MSF) 178 
EMPLOYEES (wsid: MULTIFLO) 390 
enclose 255 

OER, event report 305 

ERRATTNP (wsid: ERROR) 481 
ERRATTNS (wsid: ERROR) 480 
QOERROR 307 

error detection 304 

error messages 173 

error Signalling 307 

ERROR ws 317 

escape from input 123 

ESCAPE (wsid: INPUT) 125 

DES, event simulation 308 

OET, event type 308 

evaluated input 120 

exception handling 303 

execute 12 25 34 119 

EXECUTE 109 220 

EXECUTE (wsid: MULTIFLO) 436 


familiarization 158 165 

FCBOOK (wsid: INTEREST) 477 

FCYIELD (wsid: INTEREST) 298 

OFD, function definition 26 188 
198 245 

OFI, fix input 120 

fields 215 

file block size 226 

file design 160 166 203 208 

file documentation 212 

file efficiency 210 

File Manager 216 

file organization 204 

file utilities 203 215 225 

FILEDOC 214 

FILEDOC ws 317 

FILEDOC (wsid: FILEDOC) 386 

files, inverted 210 
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INDEX 


files, layered 211 
financial utilities 
first difference 95 


282 


FLF 180 

FLF ws 317 

floating point storage 34 59 103 
floppy disk 316 

flowcharting 162 169 

OFMT, format 133 135 145 151 


FNCREATE (wsid: FNSFILE) 
FNIDS ws 317 

FNREP ws 317 
FNSFILE ws 317 

FOR 221 

FOR (wsid: MULTIFLO) 
forecasting 274 
format by example 
FORMAT ws 317 
formatting by row 50 
formatting with newlines 
FORTRAN 157 

frequency counts 92 
FROMDAYS (wsid: DATES) 
FROMDAYSA (Cwsid: DATES) 
FROMDAYS360 (wsid: DATES) 
FROMMDY (wsid: DATES) 139 
FROMMDYA (Cwsid: DATES) 140 
FROMOTS (wsid: DATES) 143 
FROMYD (wsid: DATES) 365 
function documentation 188 
function identifiers 194 
function size 182 
functions on file 
future value 285 
OFX, fix 26 198 309 311 


383 


436 


133 135 145 152 


151 


141 
142 
143 


228 


GETFN (wsid: FNSFILE) 385 


global changes 181 
global passing 183 
global variables 170 


grade vector 53 


grade-up, dyadic 56 63 67 86 96 


267 
HEADINGS 148 
HEADINGS (wsid: FORMAT) 332 
heterogeneous arrays 259 
IDENTIFY 194 
IDENTIFY C(wsid: FNIDS) 451 


identity element 30 101 104 321 


identifier localization 
312 

IF C(wsid: INPUT) 
IF (wsid: MSF) 
INDEX 218 

INDEX (Cwsid: MULTIFLO) 
INDEXA 219 
INDEXA (wsid: 
INDEXWS 218 
INDEXWS (wsid: MULTIFLO) 
INDEXWSA 220 

INDEXWSA (wsid: MULTIFLO) 
INITFILE 216 

INITFILE (Cwsid: MULTIFLO) 
inner product 67 94 
input validation 122 
INPUT ws 317 
integer storage 
interactive functions 
INTEREST ws 317 
interest, effective 
interest, force of 
interest, nominal 
introduction 1 
IOTA 217 

IOTA Cwsid: MULTIFLO) 
IOTA (Cwsid: SEARCH) 
TOTARHO 217 

IOTARHO (wsid: MULTIFLO) 
lota, dyadic 65 85 
jota, monadic 47 
IPDATEMDY (Cwsid: DATES) 
TROR 294 
TROR (wsid: 
irregular arrays 


OIDLOC, 


125 
179 


412 

MULTIFLO) 423 
413 
429 


391 


34 59 259 
116 


282 


283 
283 


401 
335 


403 


132 


INTEREST ) 
293 


296 


J vectors 37 
Julian dates 
justification 


144 
42 47 148 


keywords 167 


label vector 11 
labels 172 

LAYERS 221 

LAYERS (wsid: MULTIFLO) 
OLC, line counter 
leap years 138 
least squares 
limbering up 4 
link 256 


445 
31 187 309 


273 
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LIOTA (wsid: SEARCH) 71 

LIOTA1 (wsid: SEARCH) 74 
LISTEMP (wsid: MSF) 179 

LJUST (Cwsid: FORMAT) 42 

loan amortization schedule 290 
LOCALIZE 197 

LOCALIZE (wsid: FNIDS) 455 
logical partition operations 237 
logical reductions 233 

logical scalar functions 231 
logical scans 234 

loop avoidance 15 

LOOP ws 317 

LOOPI (wsid: LOOP) 14 

looping 8 11 36 172 

looping primitive 13 

lower limits 72 

LPROMPTE (wsid: INPUT) 362 
OLX, latent expression 168 170 


major-to-minor sort 54 

map operations 236 

maps 234 

matrix nest 254 

matrix sorting 55 

maximum scan 101 

MAXRED (wsid: REDUCE) 342 

mean squared error 273 278 

MESSAGE (wsid: INPUT) 125 

MESSAGE (wsid: MSF) 179 

OMF, monitor facility 33 

Milky-Way reduction 111 

Milky-Way reduction on files 113 

MINRED (wsid: REDUCE) 343 

MONIOTA (wsid: UTILITY) 48 

mortality tables 90 

MSF 180 

MSF ws 318 

MULTIFLO ws 318 

MULTISA ws 318 

MULTI2 ws 318 

multi-set transposed files 206 
222 

multi-target branching 9 


naming conventions 171 
nested array emulation 257 
nested arrays 253 
Newton-Raphson Method 295 
NEXTI Cwsid: LOOP) 15 
NINPUT (Cwsid: INPUT) 121 
NINPUT (wsid: MSF) 179 


INDEX 


NINPUT2 (wsid: INPUT) 122 
ONL, name list 188 

NNCAT (wsid: NNFNS) 471 
NNCATSS (wsid: NNFNS) 470 
NNCATSV (wsid: NNFNS) 470 
NNCATVS (wsid: NNFNS) 470 
NNCATVV (Cwsid: NNFNS) 470 
NNEST (Cwsid: NNFNS) 469 
NNFNS ws 318 

NNIDX Cwsid: NNFNS) 471 
NNIDXA (wsid: NNFNS) 472 
NNLEN (wsid: NNFNS) 473 
NNSUMCOL (wsid: NNFNS) 473 
NOMINAL (Cwsid: INTEREST) 285 
nonlinear curve fitting 277 
NPROMPTE (Cwsid: INPUT) 126 
NPROMPTE2 (wsid: INPUT) 127 
NRECARECL 447 

numeric nest 261 

NXPROMPTE (wsid: INPUT) 479 
n-way logical reduction 104 
n-way max/min reduction 100 
n-way plus reduction 97 
n-way reduction on files 109 
n-way reductions 93 


OBFUSCATE 367 

origin 173 

ORRED (wsid: REDUCE) 345 
outer product 70 94 
output 172 


packing columns 57 

pPANDMAP (wsid: BOOLEAN) 244 
PANDRED (wsid: BOOLEAN) 240 
PANDSCAN (wsid: BOOLEAN) 240 
partition utilities 238 
partition vector 238 258 
PEQMAP (wsid: BOOLEAN) 243 
PEQSCAN (wsid: BOOLEAN) 242 
perpetuity 287 

PGEMAP (wsid: BOOLEAN) 243 
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